GitXplorerGitXplorer
s

mg-ngram-repetitions

public
0 stars
1 forks
0 issues

Commits

List of commits on branch master.
Unverified
9a1c11f959b18ece955b16e92028f31519a1e4fd

Remove 'salvaged' module.

ssouthgate committed 11 years ago
Unverified
4cc2d569eccda2fd2888557c62ae4f45c01e422c

Update README.md

committed 11 years ago
Unverified
a259be485f8e65675b9e32f58d8c0f5714be507e

Add the README forealzies

ssouthgate committed 11 years ago
Unverified
9b5443868c67909712ccad00490f360df1b40ecc

Add a README and remove old versions.

ssouthgate committed 11 years ago
Unverified
fc0dc03e727e816bbd033e3bcb697ac36c5b80b9

Add tests and refactor repeitions module.

ssouthgate committed 11 years ago
Unverified
a92c7c389226c0ea449a58cded7be8e241ac808a

Add gitignore

ssouthgate committed 11 years ago

README

The README file for this repository.

N-Gram Repetition Analysis

Python module to analyze repetition of n-grams of up 15 words in length.

Running

Move the Corpus directory into this directory and run the following:

python repetition.py

This will create CSV results in the output directory. Global results will live in output/Corpus while results specific to each work will live in output/Corpus/works.

Running the Tests

There are some rudimentary tests that just make sure the reptition module runs. It doesn't contain any validation of the output YET.

The fixtures directory contains test data.

To run the tests:

python repetition_tests.py