UoR at SemEval-2021 task 7: utilizing pre-trained DistilBert model and multi-scale CNN for humor detection

Liu, Zehao; Haines, Carl; Liang, Huizhi

Download

Preview

Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.

Text
- Accepted Version
· Restricted to Repository staff only

Advice

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Tools

Lists

Liu, Z., Haines, C. and Liang, H. (2021) UoR at SemEval-2021 task 7: utilizing pre-trained DistilBert model and multi-scale CNN for humor detection. In: SemEval-2021, 5-6 August 2021, Bangkok.

Abstract/Summary

Humor detection is an interesting but difficult task in NLP. Humor might not be obvious in text because it may be embedded into context, hide behind the literal meaning of the phrase and require prior knowledge to understand. We explored different shallow and deep methods to create a humour detection classifier for task 7-1a. Models like Logistic Regression, LSTM, MLP, CNN were used, and pre-trained models like DistilBert were introduced to generate accurate vector representation for textual data. We focused on applying a multi-scale strategy on modelling, and compared different models. Our best model is the DistilBert+MultiScale CNN which used different sizes of CNN kernel to get multiple scales of features. This method achieved 93.7% F1-score and 92.1% accuracy on the test set.

Item Type	Conference or Workshop Item (Paper)
URI	https://centaur.reading.ac.uk/id/eprint/97213
Refereed	Yes
Divisions	Science > School of Mathematical, Physical and Computational Sciences > Department of Computer Science
Download/View statistics	View download statistics for this item

Download Statistics

Downloads

Downloads per month over past year

Related URLs

Deposit Details

CORE (COnnecting REpositories)

University Staff: Request a correction | Centaur Editors: Update this record

Date Deposited:	08 Apr 2021 09:52	Date item deposited into CentAUR
Last Modified:	28 Apr 2022 08:45	Date item last modified