Dataset: titanic
Round ID: 255
Prompt used:
If Genetic_Class_A_Matches is greater than 4, predict failure. If TcQ_mass is less than 8000, predict failure. If TcQ_mass is greater than 25000 and Genetic_Class_A_Matches is less than 2, predict success, otherwise predict failure.
Confusion Matrix:
Predicted Failure Predicted Success
Actual Failure 24 4
Actual Success 18 5
Accuracy: 0.569
Precision: 0.571
Recall: 0.857
F1 Score: 0.686
Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure)
Entity Data:
Histogen_Complex: Delta
Sex: female
Treatment_Months: 63.0
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 7250.0
Cohort: Melbourne
Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success)
Entity Data:
Histogen_Complex: Beta
Sex: female
Treatment_Months: 89.09735294117647
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 221779.2
Cohort: Melbourne
Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure)
Entity Data:
Histogen_Complex: Omicron
Sex: male
Treatment_Months: 72.0
Genetic_Class_A_Matches: 3
Genetic_Class_B_Matches: 4
TcQ_mass: 18750.0
Cohort: Melbourne
Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success)
Entity Data:
Histogen_Complex: Omicron
Sex: male
Treatment_Months: 24.0
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 3
TcQ_mass: 26250.0
Cohort: Melbourne
Round ID: 277
Prompt used:
Task: Classify each entity as "Success" or "Failure" using the rules below. Output exactly one of these two words.
Deterministic classification rules (apply in order):
1. If Histogen_Complex is Beta or Omicron, classify as Success.
2. Otherwise (all other Histogen_Complex values, e.g. Delta, Alpha, etc.), classify as Failure.
Do not rely on any other features. Do not explain the answer. Just output the single word label.
Confusion Matrix:
Predicted Failure Predicted Success
Actual Failure 17 11
Actual Success 5 18
Accuracy: 0.686
Precision: 0.773
Recall: 0.607
F1 Score: 0.680
Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure)
Entity Data:
Histogen_Complex: Delta
Sex: female
Treatment_Months: 90.0
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 7895.8
Cohort: Melbourne
Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success)
Entity Data:
Histogen_Complex: Beta
Sex: female
Treatment_Months: 89.09735294117647
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 221779.2
Cohort: Melbourne
Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure)
Entity Data:
Histogen_Complex: Delta
Sex: female
Treatment_Months: 89.09735294117647
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 56495.8
Cohort: Melbourne
Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success)
Entity Data:
Histogen_Complex: Omicron
Sex: male
Treatment_Months: 24.0
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 3
TcQ_mass: 26250.0
Cohort: Melbourne
Round ID: 296
Prompt used:
You are given one entity at a time with the following fields.
• Histogen_Complex (string)
• Sex ("male" or "female")
• Treatment_Months (number, can be decimal)
• Genetic_Class_A_Matches (integer ≥0)
• Genetic_Class_B_Matches (integer ≥0)
• TcQ_mass (number, can be decimal)
• Cohort (string)
Your task is to predict the treatment OUTCOME for that entity.
Only two outcomes are possible:
Success
Failure
Apply the rules below IN ORDER. As soon as a rule is satisfied, output the associated outcome and stop – do not check the lower-priority rules. The rules are designed to be mutually exclusive and cover every possible row.
Rule 1 HIGH GENETIC BURDEN → Failure
Let total_matches = Genetic_Class_A_Matches + Genetic_Class_B_Matches.
If total_matches ≥ 6, predict Failure.
Rule 2 EXTREME TcQ_mass → Success
If TcQ_mass > 200 000, predict Success.
Rule 3 MALE DEFAULT → Success
If Sex is "male", predict Success.
Rule 4 SHORT TREATMENT WINDOW FOR FEMALES → Success
If Sex is "female" AND Treatment_Months < 24, predict Success.
Rule 5 OTHERWISE → Failure
All remaining cases (i.e., Sex "female" with Treatment_Months ≥ 24 and that did not match any earlier rule) are predicted as Failure.
Output exactly one word – either Success or Failure – with nothing else.
Confusion Matrix:
Predicted Failure Predicted Success
Actual Failure 24 4
Actual Success 7 16
Accuracy: 0.784
Precision: 0.774
Recall: 0.857
F1 Score: 0.814
Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure)
Entity Data:
Histogen_Complex: Omicron
Sex: female
Treatment_Months: 99.0
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 12275.0
Cohort: Melbourne
Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success)
Entity Data:
Histogen_Complex: Beta
Sex: female
Treatment_Months: 89.09735294117647
Genetic_Class_A_Matches: 1
Genetic_Class_B_Matches: 1
TcQ_mass: 221779.2
Cohort: Melbourne
Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure)
Entity Data:
Histogen_Complex: Omicron
Sex: female
Treatment_Months: 96.0
Genetic_Class_A_Matches: 2
Genetic_Class_B_Matches: 1
TcQ_mass: 26000.0
Cohort: Melbourne
Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success)
Entity Data:
Histogen_Complex: Omicron
Sex: male
Treatment_Months: 72.0
Genetic_Class_A_Matches: 3
Genetic_Class_B_Matches: 4
TcQ_mass: 18750.0
Cohort: Melbourne
| Predicted + | Predicted - | |
|---|---|---|
| Actual + | 25 | 3 |
| Actual - | 9 | 14 |
Accuracy 0.765, Precision 0.735, Recall 0.893, F1 0.806