AFINN

AFINN#

Sentiment ratings for common emotion and attitude words 🗣️, including internet slang.#

Summary

Composition:

Approximately 3.4k unigram entries (with 30 bigrams, such as “fed up”)
Words representing attitudes and emotions
Multi-Class labels from -5 (very negative) to +5 (very positive)

Creation Methodology:

From a variety of public domain sources, the author collected obscene words, affective words, emotion words and internet slangs
Manual annotation by the author starting in 2009
Refinements and additions over multiple versions

Evaluation: Nielsen (2011) compared AFINN against four other sentiment dictionaries on a dataset of 1,000 tweets from the Pulse of a Nation study, where each tweet was manually labelled for sentiment strength by 10 independent raters.

AFINN achieved the highest Pearson correlation with human ratings (0.564), outperforming comparable lexicons including Harvard GI (0.374), OpinionFinder (0.458), and ANEW (0.525).

Usage Guidance: General purpose lexicon suitable for getting started with sentiment analysis. Can pair with domain-specific lexicons. Access via sentibank.archive.load().dict(“AFINN_v2011”)

📋 Introduction#

The AFINN (Nielsen, 2011) sentiment lexicon was created to capture both common emotional words and informal internet slang. It provides manually annotated sentiment labels ranging from -5 to +5 for each entry. Though Harvard General Inquirer and OpinionFinder have larger word lists, the author notes AFINN’s granular sentiment scoring appears superior to simple binary polarity. This highlights the value of manual valence annotations even for informal terms.

📚 Original Dictionary#

ver.2009#

AFINN dictionary was initiated in 2009 using tweets in relation to the United Nation Climate Conference (COP15). Over 1,400 lexicons were manually scored by the author based on observed usage on social media. The dictionary was gradually expanded by examining social media posts and internet slang terms from public sources.

ver.2011#

The 2011 update to the AFINN sentiment lexicon saw the addition of 997 new words and phrases, bringing the total to 2,477 entries. This expansion drew from several sources to cover a wider range of sentiment-conveying language.

The author started by adding a set of obscene and profane terms based on prior taboo word lists (Baudhuin, 1973; Sapolsky, Shafer, and Kaye, 2008, cited in Nielsen, 2011). Additional positive and negative emotional words came from the Original Balanced Affective Word lIst and The Compass DeRose Guide to Emotion Words. Synonyms of these entries were found using Wiktionary to broaden the lexical coverage. Note that ambiguous words like “surprise” were excluded to focus on clear sentiments. Most positive terms were rated +2 and most negatives -2, with strong obscenities scoring -4 or -5. Internet slang and acronyms were pulled from Urban Dictionary to capture modern informal language.

ver.2015#

By 2015, the AFINN lexicon had grown to 3,382 total entries through further expansion of sources and coverage. This iterative, crowdsourced approach allowed the sentiment dictionary to reflect real language usage across formal and informal domains.

from sentibank import archive 

load = archive.load()
afinn = load.origin("AFINN_v2015")

AFINN (Nielsen, 2011)
lexicon	score
Loading... (need help?)

🧹 Processed Dictionary#

From the original csv, no notable changes were made.