[Wikipedia cohere 1024 dimensional dataset](https://huggingface.co/datasets/CohereLabs/wikipedia-2023-11-embed-multilingual-v3) - [x] Ingest 250M records - [ ] Conduct load test on full-resolution vectors - [x] QPS - [x] p99 latency - [ ] recall @ k - [ ] Conduct same load test with following quantization options - [ ] Binary Quantization - [ ] Scalar Quantization - [ ] Binary two-bit - [ ] Binary 1.5-bit - [ ] TurboQuant (new feature) - [ ] Run same benchmarks with varying m and ef_construct parameters
Wikipedia cohere 1024 dimensional dataset