GitXplorerGitXplorer
s

people-occupations-classifier

public
5 stars
3 forks
1 issues

Commits

List of commits on branch master.
Unverified
9bc1d9a970aff844d337e55aebef4b2695507289

add pipeline script

ssdabhi23 committed 5 years ago
Unverified
e399c469da4b15464b5785a0bf66fa35c46fa97a

Merge branch 'master' of https://github.com/sdabhi23/people-occupations-classifier

ssdabhi23 committed 6 years ago
Unverified
8b7823cf435205497d96cc205e48815f20cf1706

add comparision (enhancement)

ssdabhi23 committed 6 years ago
Verified
c6c5854865d425a353c2a1ae35cfeb7ba71b5a99

Update README.md

ssdabhi23 committed 6 years ago
Unverified
63812657715b0e97fbb4df5125eae06ba0d92db6

completed

ssdabhi23 committed 6 years ago
Unverified
7947c300971c5fb354a042e1f9ded9c4f14cca6f

add some docs

ssdabhi23 committed 6 years ago

README

The README file for this repository.

people-occupations-classifier

This project has been undertaken as a part of our undergraduate coursework, to understand generation of datasets and classification algorithms.

Getting started

  1. Clone this repo

    $ git clone https://github.com/sdabhi23/people-occupations-classifier.git
  2. Setup virtual environment (optional)

    $ virtualenv .venv
    # for windows
    $ .venv\Scripts\activate
    # for *nix
    $ .venv/Scripts/activate
  3. Installing the required libraries

    $ pip install -r requirements.txt
  4. Additional steps for nltk

    >>> import nltk
    >>> nltk.download('stopwords')
  5. Steps to configure ipython kernel

    Required only if using virtual environment

    $ ipython kernel install --user --name=people_classifier

    Then change the kernel in the jupyter interface to people_classifier.

Maintainers

References

  • Classifying Wikipedia People Into Occupations by Aleksander Gabrovski (.pdf)
  • Multi-Class Text Classification with Scikit-Learn (article)
  • Using jupyter notebooks with a virtual environment (article)