- Published on
We propose a practical acquisition function for prompt/completion pairs based on the predictive entropy of the language model and a measure of certainty of the implicit preference model optimized by DPO.
Vectify AI is developed by researchers and professors from UCL and Oxford. Our products are powered by cutting-edge AI research.