GitXplorerGitXplorer
i

youtrack-issue-classifier

public
1 stars
0 forks
0 issues

Commits

List of commits on branch master.
Unverified
733a1504e0d8665786100c0bbe17c321dc370d75

Include stack trace

iilinum committed 9 years ago
Unverified
f782bb001a6702bc6a2fe497ce0aa7681fd77d20

Update mllib version

iilinum committed 9 years ago
Unverified
74f5f6c037f8c5e6d8c8617b6942a60fdbe9cd3b

Added author

jjamesbrain committed 9 years ago
Unverified
e6c337808b9208b8ec368f3608441e2ead8e2932

Cleanup (Mostly nGram code, which is now generalized)

iilinum committed 9 years ago
Unverified
86771aa672e13bdd4d21f1a4f782b59e206abcf5

Added n-grams and stemmer

jjamesbrain committed 9 years ago
Unverified
644171200f13befa6784742fc7026006434e7c5e

Update README.md

iilinum committed 9 years ago

README

The README file for this repository.

Youtrack Issue Classifier

Youtrack Issue Classifier is an Machine Learning algorithm that tries to determine the subsystem of a ticket based on its context. It uses Apache Spark and its MLlib

The algorithm uses Naive Bayes Classifier to study how old tickets were divided into subsystems on YouTrack and tries to guess what subsystem should new tickets be assigned to

#How to run ##Terminal To run you should use the following command:

$SPARK_HOME/bin/spark-submit --class "me.ilinskiy.issueClassifier.YoutrackClassifier" --master local[4] $OUTPUT_JAR

##IntellliJ IDEA All you need to do is set VM options to: -Dspark.master=local[4] Then you can run the main method normally.

#Misc Issues are downloaded in issues directory if the directory is not present. Later, they are just reused, without redownloading