O'Reilly logo
  • clancy birrell thinks this is interesting:

Both lexicons have more negative than positive words, but the ratio of negative to positive words is higher in the Bing lexicon than the NRC lexicon. This will contribute to the effect we see in the plot above, as will any systematic difference in word matches, for example, if the negative words in the NRC lexicon do not match very well with the words that Jane Austen uses. Whatever the source of these differences, we see similar relative trajectories across the narrative arc, with similar changes in slope, but marked differences in absolute sentiment from lexicon to lexicon. This is important context to keep in mind when choosing a sentiment lexicon for analysis


Cover of Text Mining with R


The text here ignores the fact that the NRC sentiment lexicon has a multiple class structure and is more nuanced in it's classification rather than a simple positive/negative in the other lexicons.