Sgd + Number Of Doc In Ss

using gradient descent to find the optimal number of similarity search such that:

we guarantee a 100% hit rate for ground truth datasets (needs human or llm curation)
we guarantee a quality lower bound for ground

Gaia Prime