GitXplorerGitXplorer
T

srt2vocab

public
0 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
6d43ecb2c1b45a86ba1dca8c825f23e755fabbb1

init

committed 2 years ago

README

The README file for this repository.

SRT2vocabulary

Script for converting subtitles in SRT format to a wordlist.

The purpose of this is to get new vocabulary to learn from series I watch.

The idea is to get a small list of uncommon words that can be redacted manually and used for learning the unknown words in a tool like vocabulary.com.

Vocabulary.com screenshot

Example Usage

src/main.py samples/The.Witcher.S02E01.srt common_words/30k.txt samples/witcher_words.txt

Most common English words lists from https://github.com/derekchuank/high-frequency-vocabulary