Sentiment-bug-Winata
Task Identifier: sentiment-2022-winata-bug
Cluster: Sentiment Analysis
Data Type: bug
Score Metric: Macro-F1
Paper/GitHub/Website URL:


RankSubmission Title Model
URL
Score
Details
1
Finetuned LMs mBERT
74.5701
2
Finetuned LMs Bernice
73.8462
3
Finetuned LMs XLM-Twitter
71.6028
4
Finetuned LMs InfoDCL
71.5453
5
Finetuned LMs TwHIN-BERT
70.7888
6
Finetuned LMs XLM-RoBERTa-Base
67.7155
7
Finetuned LMs XLM-RoBERTa-Large
67.4844
8
Zero-shot Chatgpt
34.634
9
Zero-shot BLOOMZ-P3-7B
34.6032
10
three-shot in-context learning LLaMA-7B
34.2855
11
five-shot in-context learning LLaMA-7B
33.6372
12
three-shot in-context learning BLOOM-7B
32.4471
13
Zero-shot Chatgpt with translated prompts
31.1874
14
Baseline Random
30.7686
15
five-shot in-context learning BLOOM-7B
30.2298
16
three-shot in-context learning BLOOMZ-P3-7B
29.6181
17
five-shot in-context learning BLOOMZ-P3-7B
29.3402
18
three-shot in-context learning mT0-XL
25.9206
19
three-shot in-context learning mT5-XL
22.0791
20
Zero-shot Bactrian-BLOOM
20.9184
21
Zero-shot BLOOMZ-7B
20.6918
22
five-shot in-context learning mT5-XL
20.6502
23
Zero-shot LLaMA-7B
19.5948
24
Zero-shot BLOOM-7B
19.0365
25
five-shot in-context learning mT0-XL
18.9973
26
Baseline Majority
18.4448
27
Zero-shot Alpaca-7B
18.4448
28
Zero-shot mT5-XL
18.3908
29
Zero-shot mT0-XL
18.2698
30
five-shot in-context learning Vicuna-7B
17.7115
31
Zero-shot Bactrian-LLaMA-7B
16.016
32
three-shot in-context learning Vicuna-7B
13.7427
33
Zero-shot Vicuna-7B
12.9032