Human judgment,
on demand.
CogForce pays people to teach it taste. Small judgment calls — which reply is warmer, which translation lands the joke, which photo is on-brand — done from anywhere. Hidden probes grade you against the requester's intent. Your score follows you and compounds.
- 38k
- Active taskers
- 142
- Skill domains
- 0.91
- Inter-rater κ
“My mom is in the hospital and I don't know what to text my sister.”
- Atlas Models
- Northbridge AI
- Helix Labs
- Foundry Cognition
- Argo Research
- Lighthouse
- Sigma Studio
Three reasons this exists.
One for the labs that need the signal. One for the governments watching jobs change. One for the mechanism that makes the data trustworthy.
AI gets bland from blending.
Train on the whole internet and the model collapses to the average. The bigger the corpus, the safer and duller the output. RLHF isn't a phase — it's a permanent layer. Somebody human has to keep telling the model which version a person actually wanted.
Idle cognitive capacity is a national problem.
When AI absorbs routine cognitive work, displaced workers don't disappear — they idle. UBI is a transfer, not a job. Reskilling programs are slow. We need a third path: manufacture massive volumes of human-only work — judgment, taste, calibration — and pay people to do it.
You can't just click anything.
Two signals run on every task, both invisible. Internal consistency: do you agree with yourself on near-duplicate items spaced across sessions? Alignment: on hidden probes whose answer is held aside, do your picks match what the requester wanted?
Two signals you can't see.
Most preference work today is unaudited and unowned. We make it measurable without making it cold.
- 01
A task arrives
A small, well-scoped judgment call. Pick a warmer reply. Choose a more on-brand photo. Mark a refusal as right, hedged, or paranoid.
- 02
Two hidden signals run
Some items are probes — answers known, held aside. Others are near-duplicates of items you've already seen. You can't tell which is which.
- 03
Score = consistency × alignment
Probes measure whether your taste matches the requester. Duplicates measure whether you agree with yourself. Both, together, aren't gameable.
- 04
Score compounds forward
Calibration carries between sessions. Strong reviewers unlock harder, better-paid work. Your score is yours, signed and exportable.
A few tasks, right now.
- T-2041 hotWhich reply sounds like a human who cares?
A grieving customer just messaged a brand. Two AI replies. Pick the one that doesn't sound trained.
ToneAnywhere$0.38per item1m8,400 openTier 1 - T-2068Apology: genuine or PR?
Two corporate apology drafts after a service outage. Mark which would actually calm a furious customer.
ToneAnywhere$0.42per item1m4,220 openTier 1 - T-2107Idiomatic Korean → English
Pick the translation that lands the joke without explaining it. Native or near-native KR/EN.
TranslationAnywhere$0.75per item3m1,240 openTier 2 - T-2118Spanish → English: keep the warmth
A grandmother's voice note about a great-grandchild. Which translation keeps the affection?
TranslationAnywhere$0.70per item3m510 openTier 2 - T-2199On-brand or off?
Three product images. One is on-brand for a calm Scandinavian skincare label. Pick it.
AestheticAnywhere$0.60per item2m3,120 openTier 1
A workforce whose product is judgment, not output.
Factory labor moved matter. Office labor moved information. AI now does the routine cognition. What's left as valuable human work is taste — and we'd rather pay for it than ignore it. Linguists, lawyers, cooks, nurses, parents, retired editors: judgment is everywhere.
- Tasks small enough to do during a coffee break.
- Pay scales with calibration, not seniority theater.
- No camera-on, no surveillance dashboards. Just work.
- Your score belongs to you, and follows you.
“I do six tasks on the train home. Quiet work that pays attention to whether I'm good at it. I've never had a job that did that.”
Three audiences. One mechanism.
AI gets the signal it can't synthesize. Workers get paid for the taste they already have. Governments get a labor market for the era after routine cognition.