Absolute Zero: Reinforced Self-Play Reasoning with Zero Data

1 point by mentalgear 1 month ago | 1 comment
  • mentalgear 1 month ago
    Linking to the official project page as it is an excellent visualisation of the ideas in the paper. Paper-link can be found in the header.