Question 1

What kind of work does CogForce route?

Accepted Answer

Small, well-scoped human judgment calls — picking the warmer of two AI replies, choosing the more on-brand microcopy, marking whether an AI's refusal was right, hedged, or paranoid. Each task is a single decision; no cognitive load is carried between items.

Question 2

How does CogForce grade a tasker without knowing the right answer?

Accepted Answer

Two invisible signals run on every task. Probes are items where expert consensus is already known and held aside; near-duplicate items spaced across sessions measure whether a tasker agrees with themselves. Probes look identical to non-probes, the share rotates, and neither signal alone is gameable.

Question 3

Is this RLHF, DPO, or something else?

Accepted Answer

Both. CogForce delivers per-item consensus weighted by reviewer calibration, plus disagreement structure and per-domain reviewer scores — usable directly for RLHF reward modeling, DPO preference pairs, or evaluation suites.

Question 4

How does calibration compound?

Accepted Answer

Calibration carries between sessions and is per-domain. Strong reviewers unlock harder, better-paid work and start training the next layer. A tasker's score is portable, signed, and exportable.

Question 5

Where does the work happen?

Accepted Answer

Anywhere. Tasks are designed to be small enough to do on a phone in five minutes — on the train, at a kitchen table, between meetings. No webcam, no shift schedules, no surveillance dashboards.

How a guess becomes a graded judgment.

01 · Tasks are small judgment calls

02 · Some items are probes

03 · Accuracy becomes weight

04 · Score compounds forward

Common questions

What you actually get back.