Hi!
I have 2 collections, both with Search Indexes. Documents in both collections contain an OpenAI embedding with 1536 values. The first collection has 10k documents, the second one has 100k documents.
Cluster specs:
- Frankfurt
- M10 (General)
- 3 replica nodes
- no search nodes
I have 2 issues:
- queries have a very variable response times - sometimes it’s as little as 0.5 sec, sometimes it can even reach 4 seconds (especially in the case of the 100k docs collection)
- it seems like the response times expierience “cold start” - first queries take the longest, the subsequent ones are pretty ok
Can I do something about it? Can it be sped up? I really would like to have a consistent 0.5-1.0sec repsonse times.