Skip to content
#

preference-optimization

Here are 26 public repositories matching this topic...

LLM_InSight

This my home rig testing process for creating evaluation metric, testing models, automating prompt creation in accordance to the evaluation results of last run and reviewing logs. its local first, independent of any specific tool and logs locally.

  • Updated Jun 15, 2026
  • Python

Open-source research engineering project for building the end-to-end post-training stack for reasoning language models, including SFT, preference learning, RLHF/RLVR, evaluation, inference-time scaling, and scalable systems for frontier-level reasoning.

  • Updated Jun 21, 2026
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the preference-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preference-optimization topic, visit your repo's landing page and select "manage topics."

Learn more