Practical Text Mining with Perl

Practical Text Mining with Perl

Roger Bilisoly


Provides readers with the equipment, algorithms, and potential to accomplish textual content mining tasks

This publication is dedicated to the basics of textual content mining utilizing Perl, an open-source programming device that's freely on hand through the web (www.perl.org). It covers mining principles from a number of perspectives--statistics, information mining, linguistics, and knowledge retrieval--and offers readers with the capacity to effectively entire textual content mining initiatives on their own.

The booklet starts off with an advent to common expressions, a textual content development technique, and quantitative textual content summaries, all of that are primary instruments of studying textual content. Then, it builds upon this beginning to explore:

  • Probability and texts, together with the bag-of-words model
  • Information retrieval innovations reminiscent of the TF-IDF similarity measure
  • Concordance strains and corpus linguistics
  • Multivariate strategies corresponding to correlation, critical parts research, and clustering
  • Perl modules, German, and permutation tests

Each bankruptcy is dedicated to a unmarried key subject, and the writer conscientiously and thoughtfully introduces mathematical options as they come up, permitting readers to benefit as they pass with no need to consult extra books. The inclusion of various routines and worked-out examples extra enhances the book's student-friendly format.

Practical textual content Mining with Perl is perfect as a textbook for undergraduate and graduate classes in textual content mining and as a reference for quite a few execs who're attracted to extracting info from textual content documents.

Show sample text content

Download sample