Home
Publications
News
Projects
Teaching
Talks
Quotes
Posts
CV
Light
Dark
Automatic
Hate Speech
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection
Transformer-based Natural Language Processing models have become the standard for hate speech detection. However, the unconscious use …
Giuseppe Attanasio
,
Debora Nozza
,
Eliana Pastor
,
Dirk Hovy
PDF
Cite
Code
Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists
Natural Language Processing (NLP) models risk overfitting to specific terms in the training data, thereby reducing their performance, …
Giuseppe Attanasio
,
Debora Nozza
,
Dirk Hovy
,
Elena Baralis
PDF
Cite
Code
Cite
×