GitXplorerGitXplorer
t

anti-spoiler-qa

public
0 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
bfffe472da5e57b58356ed0cc05ae1f127a6f372

feat: initial implementation to parse a single epub

ttomsanbear committed a year ago
Verified
0de5ad68b91184de0d70f2db1320fbb306f7e180

Initial commit

ttomsanbear committed a year ago

README

The README file for this repository.

Anti Spoiler QA

A quick demo of a RAG pipeline that answers questions about a book, using excerpts from the book as it's only source of knowledge. Uses Cohere as the underlying LLM and Embedding Vector provider.

Quickstart

I used Romeo and Juliet from Project Gutenberg as an example epub, you can find and download that here: https://www.gutenberg.org/ebooks/1513.epub.noimages

  1. Install pyenv (brew install pyenv on OSX)
  2. Run ./deps.sh
  3. Create a new file named .env and add a variable with your Cohere API key COHERE_API_KEY
  4. Run python main.py and follow the prompts!

Limitations

This was a pretty quick and rudimentary take at building this pipeline, I would like to add better metdata and chapter name parsing as it's really dependent on the name of the internal HTML files right now