Text analysis dictionary on psychology

Standard

My automated dictionary creation project is making good progress, and I created a psychology dictionary from a large corpus of UK news on psychology from 1990 to 2011. Scores given to each entry word is interpreted as strength of association to psychology, and the list can be truncated based on the scores.

The words are extracted using a technique that I call the collocation-of-collocation. In this technique, a pattern ‘psycholog*’ that matches ‘psychology’, ‘psychologist’, ‘psychological’, and ‘psychologically’ is given to the system, and it finds collocations of those words. Then, those collocations are used to extract words that are semantically close to psychology. This technique is meant to overcome the limitation of collocation analysis in synonym extraction that words that have the same meaning do not co-occur.