Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Chen, X-W & Gabrenya, W. K. Jr. (2023). Is there really any good way to measure cultural intelligence, and what exactly is it, anyway? In Thomas, D. & Liao, Y. (Eds.) Handbook of Cultural Intelligence ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results