rom1504
Interested in machine learning (computer vision, natural language processing, deep learning), node.js (network, bots, web), and programming in general
Repositories
Select a repository to view its commits, contributors, and more.img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
image_embeddings
Using efficientnet to provide embeddings for retrieval
awesome-semantic-search
Semantic search with embeddings: index anything
MinecraftChat
Minecraft web based chat client
embedding-reader
Efficiently read embedding in streaming from any filesystem
rbot
bot made with mineflayer which can do task
gpu-tester
gpu tester detects broken and slow gpus in a cluster
dalle-service
Dalle service
any2dataset
Turn any collection of files into a dataset