nlpTools - Natural Language Processing Toolkit for PHP

Text Categorisation and Topic/Domain Identification

Identifies the semantic field of a given text and relates it to its corresponding topic or domain.

In order to produce this system, a Text Classification technique has to be adapted to a given set of application domains. In this demo, the Reuters Transcribed Subset is of use for training the classifier, and the learnt model is then applied to predicting the topic of the most read articles from Reuters: