GitXplorerGitXplorer
d

snakefusion

public
3 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
94f41adba17f1b1f13580404759e072d56f26e43

Support setting a batch size for analogy/similarity queries

ddanieldk committed 3 years ago
Unverified
89f1f40ec899ed38aa8e5cc546ce8441499ce580

Bump version to 0.1.6

ddanieldk committed 3 years ago
Unverified
da24c22900c2852939f150d58427073a3b5e62c2

Accept pathlib paths in all constructors

ddanieldk committed 3 years ago
Unverified
2126aa0faaa228267ca7e17d5b8590ef7d4ebff5

CI: cache Rust build artifacts

ddanieldk committed 3 years ago
Unverified
31b44b8d31f9fbd323aab3a7e41cf13270add682

Build wheel on Python 3.6, test on all recent versions

ddanieldk committed 3 years ago
Unverified
435a15af1a99c2a0d751a74b63769f3035693b30

Bump version to 0.1.5

ddanieldk committed 3 years ago

README

The README file for this repository.

🐍 snakefusion

Documentation Status pypi Version

Introduction

snakefusion is a Python package for reading, writing, and using finalfusion, fastText, floret, GloVe, and word2vec embeddings. This package is a thin wrapper around the Rust finalfusion crate.

snakefusion supports the same types of embeddings as finalfusion:

  • Vocabulary:
    • No subwords
    • Subwords
  • Embedding matrix:
    • Array
    • Memory-mapped
    • Quantized
  • Format:
    • fastText
    • finalfusion
    • floret
    • GloVe
    • word2vec

Building from source

Building snakefusion from source requires a Rust toolchain that is installed through rustup and setuptools-rust:

$ pip install --upgrade setuptools-rust

You can then build and install snakefusion in your environment:

$ pip install .

Documentation

Read the snakefusion documentation for a quickstart and API reference.

You use finalfrontier to train new embeddings, or download some pretrained embeddings.