Production-ready test-time compute optimization framework for LLM inference. Implements Best-of-N, Sequential Revision, and Beam Search strategies. Validated with models up to 7B parameters.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results