s1

    Matching o1-preview with Only 1000 Examples

    Featured
    2 Votes
    s1 media 1
    s1 media 2
    s1 media 3
    s1 media 4
    s1 media 5

    Description

    s1 is a simple recipe for test-time scaling of LLMs, achieving strong reasoning performance comparable to o1-preview using only 1,000 examples & budget forcing. Open-source model, data, and code available.

    Recommended Products