This is the human data process we follow at micro1 to ensure high-quality post-training RLHF.

The goal of this document is two-fold:

  1. Elaborately describe the inner workings behind how micro1 generates the highest quality data for RLHF on Earth

  2. Provide human data leaders and research teams a guide on key items to look out for when hiring a data vendor

We will cover our comprehensive process for sourcing, vetting, and placing top global data annotators, as well as how we ensure the production of the highest-quality data from that talent.

Parts of our RLHF platform can be called through our API and we have included endpoints for those, meanwhile our fully managed human data services can only be accessed after a demo, which can be scheduled here.