Dataset: titanic
Round ID: 245 Prompt used: Evaluate whether the treatment will be a success or failure based on the following rules. Apply the rules in sequential order and stop at the first matching rule: Rule 1: If Histogen_Complex is Beta AND Sex is female AND TcQ_mass > 50,000: - If Cohort is Delhi AND Treatment_Months < 90, predict Success. - Otherwise, predict Failure. Rule 2: If Histogen_Complex is Beta AND TcQ_mass > 45,000: - If Sex is male, predict Success. - If Sex is female AND Treatment_Months < 85, predict Success. - Otherwise, predict Failure. Rule 3: If Histogen_Complex is Beta AND TcQ_mass between 30,000 and 45,000: - If Cohort is Delhi, predict Success. - If Sex is male, predict Success. - If Sex is female AND Cohort is Melbourne AND Treatment_Months < 90, predict Success. - Otherwise, predict Failure. Rule 4: If Histogen_Complex is Beta AND TcQ_mass between 25,000 and 30,000: - If Cohort is Delhi, predict Success. - If Treatment_Months < 90 AND Sex is male, predict Success. - If Cohort is Melbourne AND Sex is female AND Treatment_Months < 85, predict Success. - Otherwise, predict Failure. Rule 5: If Histogen_Complex is Beta AND TcQ_mass < 25,000, predict Failure. Rule 6: If Histogen_Complex is Delta AND Sex is female AND TcQ_mass > 50,000: - If Cohort is Melbourne AND Treatment_Months < 80, predict Success. - Otherwise, predict Failure. Rule 7: If Histogen_Complex is Delta AND Genetic_Class_A_Matches + Genetic_Class_B_Matches > 4: - If TcQ_mass < 50,000, predict Failure. - Otherwise, predict Failure. Rule 8: If Histogen_Complex is Delta AND Sex is male AND TcQ_mass between 7,500 and 25,000: - If Cohort is Lisbon AND Treatment_Months < 70, predict Success. - If Cohort is Delhi AND Treatment_Months < 60, predict Success. - If Cohort is Melbourne AND Treatment_Months < 50, predict Success. - Otherwise, predict Failure. Rule 9: If Histogen_Complex is Delta AND Sex is female AND TcQ_mass between 7,800 and 8,200: - If Cohort is Delhi AND Treatment_Months < 65, predict Success. - Otherwise, predict Failure. Rule 10: If Histogen_Complex is Delta AND TcQ_mass < 7,800: - If Cohort is Delhi AND Treatment_Months < 85, predict Success. - If Cohort is Lisbon AND Treatment_Months < 60, predict Success. - If Sex is female AND Genetic_Class_A_Matches >= 2 AND Treatment_Months < 65, predict Success. - Otherwise, predict Failure. Rule 11: If Histogen_Complex is Omicron AND Sex is female: - If Treatment_Months < 60 AND Genetic_Class_A_Matches + Genetic_Class_B_Matches >= 5, predict Success. - Otherwise, predict Failure. Rule 12: If Histogen_Complex is Omicron AND Treatment_Months > 150: - If Sex is male AND Genetic_Class_A_Matches + Genetic_Class_B_Matches >= 4, predict Success. - Otherwise, predict Failure. Rule 13: If Histogen_Complex is Omicron AND Treatment_Months between 120 and 150: - If Sex is male AND Genetic_Class_A_Matches + Genetic_Class_B_Matches >= 3, predict Success. - Otherwise, predict Failure. Rule 14: If Histogen_Complex is Omicron AND Sex is male AND Treatment_Months < 120: - If Genetic_Class_A_Matches >= 2 AND Genetic_Class_B_Matches >= 2, predict Success. - If Treatment_Months < 50 AND (Genetic_Class_A_Matches >= 2 OR Genetic_Class_B_Matches >= 2), predict Success. - Otherwise, predict Failure. Rule 15: If Histogen_Complex is Delta AND Cohort is Delhi AND TcQ_mass > 10,000: - If Treatment_Months < 60, predict Success. - Otherwise, predict Failure. Rule 16: If Cohort is Delhi AND TcQ_mass > 40,000 AND Treatment_Months < 60, predict Success unless contradicted by Rules 6 or 7. Rule 17: If Cohort is Delhi AND TcQ_mass < 10,000 AND Treatment_Months < 70, predict Success. Rule 18: If Histogen_Complex is Delta AND Sex is male AND Cohort is Lisbon AND TcQ_mass < 9,000 AND Treatment_Months < 60, predict Success. Rule 19: If Histogen_Complex is Omicron AND Sex is male AND Treatment_Months < 60 AND Genetic_Class_A_Matches >= 1 AND Genetic_Class_B_Matches >= 2, predict Success. Rule 20: If none of the above rules apply, predict Failure. Confusion Matrix: Predicted Failure Predicted Success Actual Failure 26 2 Actual Success 8 15 Accuracy: 0.804 Precision: 0.765 Recall: 0.929 F1 Score: 0.839 Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure) Entity Data: Histogen_Complex: Omicron Sex: female Treatment_Months: 162.0 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 1 TcQ_mass: 14000.0 Cohort: Melbourne Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success) Entity Data: Histogen_Complex: Delta Sex: male Treatment_Months: 27.0 Genetic_Class_A_Matches: 2 Genetic_Class_B_Matches: 2 TcQ_mass: 15245.8 Cohort: Delhi Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure) Entity Data: Histogen_Complex: Delta Sex: female Treatment_Months: 89.09735294117647 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 1 TcQ_mass: 56495.8 Cohort: Melbourne Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success) Entity Data: Histogen_Complex: Beta Sex: male Treatment_Months: 89.09735294117647 Genetic_Class_A_Matches: 2 Genetic_Class_B_Matches: 1 TcQ_mass: 52000.0 Cohort: Melbourne
Round ID: 277 Prompt used: Task: Classify each entity as "Success" or "Failure" using the rules below. Output exactly one of these two words. Deterministic classification rules (apply in order): 1. If Histogen_Complex is Beta or Omicron, classify as Success. 2. Otherwise (all other Histogen_Complex values, e.g. Delta, Alpha, etc.), classify as Failure. Do not rely on any other features. Do not explain the answer. Just output the single word label. Confusion Matrix: Predicted Failure Predicted Success Actual Failure 17 11 Actual Success 5 18 Accuracy: 0.686 Precision: 0.773 Recall: 0.607 F1 Score: 0.680 Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure) Entity Data: Histogen_Complex: Delta Sex: female Treatment_Months: 93.0 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 1 TcQ_mass: 7775.0 Cohort: Melbourne Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success) Entity Data: Histogen_Complex: Beta Sex: female Treatment_Months: 89.09735294117647 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 1 TcQ_mass: 221779.2 Cohort: Melbourne Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure) Entity Data: Histogen_Complex: Delta Sex: male Treatment_Months: 89.09735294117647 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 1 TcQ_mass: 7879.2 Cohort: Lisbon Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success) Entity Data: Histogen_Complex: Beta Sex: male Treatment_Months: 153.0 Genetic_Class_A_Matches: 2 Genetic_Class_B_Matches: 1 TcQ_mass: 77958.29999999999 Cohort: Melbourne
Round ID: 296 Prompt used: You are given one entity at a time with the following fields. • Histogen_Complex (string) • Sex ("male" or "female") • Treatment_Months (number, can be decimal) • Genetic_Class_A_Matches (integer ≥0) • Genetic_Class_B_Matches (integer ≥0) • TcQ_mass (number, can be decimal) • Cohort (string) Your task is to predict the treatment OUTCOME for that entity. Only two outcomes are possible: Success Failure Apply the rules below IN ORDER. As soon as a rule is satisfied, output the associated outcome and stop – do not check the lower-priority rules. The rules are designed to be mutually exclusive and cover every possible row. Rule 1 HIGH GENETIC BURDEN → Failure Let total_matches = Genetic_Class_A_Matches + Genetic_Class_B_Matches. If total_matches ≥ 6, predict Failure. Rule 2 EXTREME TcQ_mass → Success If TcQ_mass > 200 000, predict Success. Rule 3 MALE DEFAULT → Success If Sex is "male", predict Success. Rule 4 SHORT TREATMENT WINDOW FOR FEMALES → Success If Sex is "female" AND Treatment_Months < 24, predict Success. Rule 5 OTHERWISE → Failure All remaining cases (i.e., Sex "female" with Treatment_Months ≥ 24 and that did not match any earlier rule) are predicted as Failure. Output exactly one word – either Success or Failure – with nothing else. Confusion Matrix: Predicted Failure Predicted Success Actual Failure 24 4 Actual Success 7 16 Accuracy: 0.784 Precision: 0.774 Recall: 0.857 F1 Score: 0.814 Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure) Entity Data: Histogen_Complex: Delta Sex: female Treatment_Months: 136.5 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 1 TcQ_mass: 7225.0 Cohort: Delhi Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success) Entity Data: Histogen_Complex: Delta Sex: male Treatment_Months: 89.09735294117647 Genetic_Class_A_Matches: 2 Genetic_Class_B_Matches: 3 TcQ_mass: 23450.0 Cohort: Melbourne Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure) Entity Data: Histogen_Complex: Beta Sex: female Treatment_Months: 111.0 Genetic_Class_A_Matches: 2 Genetic_Class_B_Matches: 2 TcQ_mass: 52554.200000000004 Cohort: Melbourne Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success) Entity Data: Histogen_Complex: Omicron Sex: male Treatment_Months: 24.0 Genetic_Class_A_Matches: 1 Genetic_Class_B_Matches: 3 TcQ_mass: 26250.0 Cohort: Melbourne
Predicted + | Predicted - | |
---|---|---|
Actual + | 25 | 3 |
Actual - | 6 | 17 |
Accuracy 0.824, Precision 0.806, Recall 0.893, F1 0.847