Humor-spa-Chiruzzo
Task Identifier: humor-2021-chiruzzo-spa
Cluster: Humor Detection
Data Type: spa
Score Metric: Macro-F1
Paper/GitHub/Website URL:


RankSubmission Title Model
URL
Score
Details
1
Finetuned LMs InfoDCL
87.664
2
Finetuned LMs Bernice
87.1976
3
Finetuned LMs XLM-Twitter
86.732
4
Finetuned LMs TwHIN-BERT
86.1322
5
Finetuned LMs XLM-RoBERTa-Large
85.9996
6
Finetuned LMs mBERT
85.4633
7
Finetuned LMs XLM-RoBERTa-Base
84.4496
8
Zero-shot Chatgpt with translated prompts
68.8015
9
five-shot in-context learning Vicuna-7B
68.4788
10
Zero-shot Chatgpt
68.2799
11
three-shot in-context learning Vicuna-7B
60.5013
12
three-shot in-context learning LLaMA-7B
55.8854
13
Baseline Random
54.578
14
five-shot in-context learning LLaMA-7B
51.9088
15
Zero-shot Vicuna-7B
50.0998
16
Zero-shot BLOOM-7B
48.2475
17
Zero-shot Bactrian-LLaMA-7B
46.1309
18
three-shot in-context learning mT5-XL
41.5381
19
five-shot in-context learning mT5-XL
40
20
Zero-shot Bactrian-BLOOM
39.3923
21
Zero-shot LLaMA-7B
38.5635
22
five-shot in-context learning mT0-XL
38.4552
23
three-shot in-context learning mT0-XL
37.4926
24
five-shot in-context learning BLOOM-7B
35.047
25
Zero-shot Alpaca-7B
34.555
26
three-shot in-context learning BLOOM-7B
34.339
27
Zero-shot mT5-XL
34.2751
28
Baseline Majority
33.3333
29
three-shot in-context learning BLOOMZ-P3-7B
32.0652
30
Zero-shot BLOOMZ-7B
32.0652
31
five-shot in-context learning BLOOMZ-P3-7B
32.0652
32
Zero-shot mT0-XL
32.0652
33
Zero-shot BLOOMZ-P3-7B
31.9728