True Positive
Last reviewed by Moderation API
A true positive in content moderation is an instance where harmful or policy-violating content is correctly identified and acted on by the moderation system. True positives are the cases the model gets right — they drive precision and recall.
