Included Stop words¶
[1]:
from stemgraphic.stopwords import EN, FR, ES, ALT_EN
Very short list of English stop words
[2]:
len(ALT_EN)
[2]:
27
[3]:
print(ALT_EN)
['a', 'am', 'an', 'and', 'are', 'as', 'at', 'been', 'for', 'from', 'in', 'is', 'of', 'on', 'or', 'out', 'so', 'such', 'that', 'the', 'these', 'this', 'those', 'to', 'upon', 'was', 'were']
The French and Spanish stop words are quite similar, but Spanish has several gender specific words (i.e. quelque vs. algun, algunos, algunas) so it is larger.
[4]:
len(FR)
[4]:
127
[5]:
len(ES)
[5]:
183
The main English stop word list is significantly larger.
[6]:
len(EN)
[6]:
316
[ ]: