EricZ's repos on GitHub
Python · 2586 人关注
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Go · 32 人关注
go-fasttext
Facebook fastText database in SQLite with Go API
Go · 27 人关注
go-sql-lsh
Locality Sensitive Hashing using Golang and SQL database
Go · 10 人关注
go-datasketch
Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)
Go · 9 人关注
datatable
An in-memory relational table in Go similar to C#'s System.Data.DataTable.
Go · 4 人关注
counter
A frequency counter similar to Python's collections.Counter with additional support of other statistics.
Python · 2 人关注
automl-gs
Provide an input CSV and a target field to predict, generate a model + code to run it.
Jupyter Notebook · 1 人关注
FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
1 人关注
garnet
Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication features. Garnet can work with existing Redis clients.
Go · 1 人关注
go-minhash
BottomK minwise hashing for streaming set similarity
Jupyter Notebook · 0 人关注
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Jupyter Notebook · 0 人关注
big-ann-benchmarks
Framework for evaluating ANNS algorithms on billion scale datasets.
Java · 0 人关注
bigdata-interop
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
JavaScript · 0 人关注
binaryworm
A small web game inspired by a puzzle.
Go · 0 人关注
binsort
Binsort is a tool to sort files of fixed-length binary records
C · 0 人关注
bitarray
efficient arrays of booleans for Python
Python · 0 人关注
ckanapi
A command line interface and Python module for accessing the CKAN Action API
TeX · 0 人关注
csc373ta
Tutorial materials for CSC373
Rust · 0 人关注
differential-dataflow
An implementation of differential dataflow using timely dataflow on Rust.
0 人关注
DiskANN
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
0 人关注
gitignore
A collection of useful .gitignore templates
Go · 0 人关注
go-mysql-server
An extensible MySQL server implementation in Go.