Immigration dictionary

Standard

This is probably the final version of my immigration dictionary. This text analysis dictionary was created using technique called the Latent Semantic Scaling, which is based on the Latent Semantic Analysis, from British newspaper corpus.

The result of the automated content analysis by this dictionary is strongly corresponds to manual coding by Amazon’s Mechanical Turks as you can see in the chart (whiskers represent 95% confidence intervals). Yet, please note that the documents coded by the dictionary are only sentences about immigration in the party manifestos selected by keywords (‘immigra*’, ‘migra*’, ‘refugee*’, ‘asylum*’, ‘foreign*’).

UK 2010 manifestos on immigration

The dictionary is made up of 750 entry words. The following is the top 30 most positive and negative words in the dictionary. Many of them are intuitively positive or negative, but some are not. For example, ‘globalisation’ is positive only in the context of immigration. This is why texts are restricted to sentences on this subject. We can spot words like ‘species’ and ‘wildebeest’, because the newspaper corpus contains stories about animal migration, but it is not too harmful.

# Positive words

1   skills            100
2   globalisation     88.24
3   chauffeured       86.93
4   airport           86.68
5   ranging           82.41
6   clearance         79.48
7   status            78.4
8   agency            74.98
9   issues            72.15
10  breed             69.45
11  claimed           68.84
12  vehemently        68.6
13  skill             67.3
14  test              65.91
15  attract           64.39
16  permanent         63.68
17  legal             59.23
18  melting-pot       57.34
19  species           57.27
20  wildebeest        56.96
21  overstaying       56.07
22  documents         55.9
23  routes            55.75
24  work              55.63
25  shambles          55.28
26  breeding          53.65
27  bringing          53.24
28  employ            52.76
29  passport          52.24
30  official          51.88


# Negative words

1   xenophobia        -141.27
2   control           -130.09
3   racist            -125.2
4   stemming          -122.5
5   tide              -122.46
6   working-class     -115.53
7   negative          -113.76
8   failure           -110.32
9   problems          -106.95
10  influx            -100.81
11  branded           -99.42
12  caused            -96.82
13  exploit           -94.11
14  first-generation  -90.78
15  warned            -89.93
16  families          -88.51
17  soaring           -86.53
18  ignored           -86.45
19  housed            -85.33
20  magnet            -84.47
21  borders           -83.18
22  newly-arrived     -83.12
23  accused           -82.89
24  evicted           -82.02
25  trickle           -81.42
26  rates             -79.42
27  fuelled           -78.34
28  flooded           -76.69
29  non-white         -76.48
30  lorries           -76.38

Leave a Reply

Your email address will not be published. Required fields are marked *