What if your HNSW index stored 3-bit embeddings instead of float32? [R]
I've been experimenting with an approach to vector indexing where the HNSW graph nodes store quantized embeddings (~388 bytes each at dim=1024) instead of float32 vectors (~4,096 bytes). The key insight: if you quantize embeddings using Lloyd-Max s…