Ensemble

Dataset: titanic

Models

Model Narratives

anthropic37

Round ID: 95
Prompt used:
	Evaluate the patient's outcome using the following deterministic rules:
	
	RULE 1: IF Histogen_Complex is Delta AND Sex is male AND Cohort is Melbourne AND Genetic_Class_A_Matches >= 3 AND TcQ_mass > 15000, THEN predict Failure.
	
	RULE 2: IF Histogen_Complex is Delta AND Sex is female AND (Cohort is Delhi OR Cohort is Lisbon) AND Genetic_Class_A_Matches >= 2, THEN predict Failure.
	
	RULE 3: IF Histogen_Complex is Delta AND Sex is male AND Cohort is Delhi AND Treatment_Months >= 75, THEN predict Failure.
	
	RULE 4: IF the patient belongs to the Delhi cohort AND Treatment_Months < 70 AND Genetic_Class_A_Matches >= 2 AND Histogen_Complex is NOT Delta, THEN predict Success.
	
	RULE 5: IF Histogen_Complex is Delta AND Sex is female AND Genetic_Class_A_Matches = 1 AND Genetic_Class_B_Matches = 1 AND Cohort is Melbourne, THEN:
	   - If TcQ_mass is between 7500-7890 AND Treatment_Months > 90, predict Success.
	   - Otherwise, predict Failure.
	
	RULE 6: IF Histogen_Complex is Beta AND Cohort is Delhi, THEN:
	   - If TcQ_mass > 50000 AND Sex is male, predict Success.
	   - If TcQ_mass > 250000, predict Success.
	   - Otherwise, predict Failure.
	
	RULE 7: IF Treatment_Months > 90 AND Genetic_Class_A_Matches = 1 AND Genetic_Class_B_Matches <= 2, THEN:
	   - If Histogen_Complex is Omicron AND Sex is male AND TcQ_mass < 15000, predict Success.
	   - Otherwise, predict Failure.
	
	RULE 8: IF Sex is male AND Genetic_Class_A_Matches >= 2 AND Treatment_Months < 75 AND Histogen_Complex is NOT Delta, THEN predict Success.
	
	RULE 9: IF Sex is male AND Genetic_Class_A_Matches = 1 AND Genetic_Class_B_Matches <= 2 AND Cohort is Melbourne AND Histogen_Complex is Delta, THEN:
	   - If TcQ_mass < 8500 AND Treatment_Months < 75, predict Success.
	   - Otherwise, predict Failure.
	
	RULE 10: IF Sex is female AND Treatment_Months > 85 AND Histogen_Complex is Delta AND Cohort is Melbourne, THEN predict Failure.
	
	RULE 11: IF Histogen_Complex is Delta AND Sex is male AND Genetic_Class_B_Matches >= 5, THEN predict Failure.
	
	RULE 12: IF Histogen_Complex is Delta AND Sex is male AND Treatment_Months < 50 AND Cohort is Delhi, THEN predict Success.
	
	RULE 13: IF Histogen_Complex is Delta AND Sex is female AND Treatment_Months < 15 AND Cohort is Melbourne AND Genetic_Class_A_Matches <= 2, THEN predict Success.
	
	RULE 14: IF Histogen_Complex is Beta AND Sex is male, THEN:
	   - If Genetic_Class_A_Matches >= 3, predict Success.
	   - If Cohort is Melbourne AND TcQ_mass > 50000, predict Success.
	   - If Cohort is Melbourne AND Treatment_Months < 60 AND Genetic_Class_A_Matches >= 2, predict Success.
	   - If Cohort is Melbourne AND Genetic_Class_B_Matches <= 3 AND Treatment_Months > 100, predict Success.
	   - Otherwise, predict Failure.
	
	RULE 15: IF Histogen_Complex is Omicron AND Sex is male AND Treatment_Months < 70, THEN predict Success.
	
	RULE 16: IF Histogen_Complex is Omicron AND Sex is female, THEN:
	   - If Treatment_Months < 90, predict Failure.
	   - If TcQ_mass > 10000 AND Treatment_Months >= 90, predict Failure.
	   - Otherwise, predict Success.
	
	RULE 17: IF Histogen_Complex is Delta AND Sex is male AND Cohort is Melbourne AND TcQ_mass < 8500 AND Treatment_Months > 75, THEN predict Success.
	
	DEFAULT RULE: If none of the above rules apply, predict Failure.
	
	Apply these rules sequentially in the order they are presented. Once a rule matches, use its prediction and stop checking further rules.

Confusion Matrix:
                Predicted Failure    Predicted Success   
Actual Failure                    24                    4
Actual Success                    13                   10

Accuracy: 0.667
Precision: 0.649
Recall: 0.857
F1 Score: 0.738

Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure)
  Entity Data:
	Histogen_Complex: Omicron
	Sex: female
	Treatment_Months: 84.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 13000.0
	Cohort: Melbourne


Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success)
  Entity Data:
	Histogen_Complex: Omicron
	Sex: female
	Treatment_Months: 81.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 26000.0
	Cohort: Melbourne


Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure)
  Entity Data:
	Histogen_Complex: Delta
	Sex: female
	Treatment_Months: 135.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 8050.000000000001
	Cohort: Melbourne


Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success)
  Entity Data:
	Histogen_Complex: Omicron
	Sex: male
	Treatment_Months: 42.0
	Genetic_Class_A_Matches: 2
	Genetic_Class_B_Matches: 1
	TcQ_mass: 30070.8
	Cohort: Delhi


gemini

Round ID: 265
Prompt used:
	If Histogen_Complex is Omicron, predict Success.
	If Genetic_Class_A_Matches is 1, and Genetic_Class_B_Matches is 1, and Histogen_Complex is Delta, and TcQ_mass is greater than 10000, predict Failure. Otherwise, if Genetic_Class_A_Matches is 1, and Genetic_Class_B_Matches is 1, and Histogen_Complex is Delta, predict Failure.
	Otherwise, predict Success.

Confusion Matrix:
                Predicted Failure    Predicted Success   
Actual Failure                    14                   14
Actual Success                     3                   20

Accuracy: 0.667
Precision: 0.824
Recall: 0.500
F1 Score: 0.622

Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure)
  Entity Data:
	Histogen_Complex: Delta
	Sex: female
	Treatment_Months: 60.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 8662.5
	Cohort: Melbourne


Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success)
  Entity Data:
	Histogen_Complex: Omicron
	Sex: female
	Treatment_Months: 81.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 26000.0
	Cohort: Melbourne


Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure)
  Entity Data:
	Histogen_Complex: Delta
	Sex: male
	Treatment_Months: 89.09735294117647
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 7879.2
	Cohort: Lisbon


Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success)
  Entity Data:
	Histogen_Complex: Beta
	Sex: female
	Treatment_Months: 75.0
	Genetic_Class_A_Matches: 2
	Genetic_Class_B_Matches: 1
	TcQ_mass: 55441.7
	Cohort: Delhi


openaio310

Round ID: 296
Prompt used:
	You are given one entity at a time with the following fields.
	• Histogen_Complex  (string)
	• Sex               ("male" or "female")
	• Treatment_Months  (number, can be decimal)
	• Genetic_Class_A_Matches (integer ≥0)
	• Genetic_Class_B_Matches (integer ≥0)
	• TcQ_mass          (number, can be decimal)
	• Cohort            (string)
	
	Your task is to predict the treatment OUTCOME for that entity.
	Only two outcomes are possible:
	  Success
	  Failure
	
	Apply the rules below IN ORDER. As soon as a rule is satisfied, output the associated outcome and stop – do not check the lower-priority rules. The rules are designed to be mutually exclusive and cover every possible row.
	
	Rule 1  HIGH GENETIC BURDEN → Failure
	  Let total_matches = Genetic_Class_A_Matches + Genetic_Class_B_Matches.
	  If total_matches ≥ 6, predict Failure.
	
	Rule 2  EXTREME TcQ_mass → Success
	  If TcQ_mass > 200 000, predict Success.
	
	Rule 3  MALE DEFAULT → Success
	  If Sex is "male", predict Success.
	
	Rule 4  SHORT TREATMENT WINDOW FOR FEMALES → Success
	  If Sex is "female" AND Treatment_Months < 24, predict Success.
	
	Rule 5  OTHERWISE → Failure
	  All remaining cases (i.e., Sex "female" with Treatment_Months ≥ 24 and that did not match any earlier rule) are predicted as Failure.
	
	Output exactly one word – either Success or Failure – with nothing else.

Confusion Matrix:
                Predicted Failure    Predicted Success   
Actual Failure                    24                    4
Actual Success                     7                   16

Accuracy: 0.784
Precision: 0.774
Recall: 0.857
F1 Score: 0.814

Examples for Correctly predicted Failure: (Correct answer: Failure, What the previous set of rules predicted: Failure)
  Entity Data:
	Histogen_Complex: Delta
	Sex: female
	Treatment_Months: 90.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 7895.8
	Cohort: Melbourne


Examples for Falsely predicted Success when it should have been Failure: (Correct answer: Failure, What the previous set of rules predicted: Success)
  Entity Data:
	Histogen_Complex: Delta
	Sex: male
	Treatment_Months: 120.0
	Genetic_Class_A_Matches: 2
	Genetic_Class_B_Matches: 1
	TcQ_mass: 9475.0
	Cohort: Melbourne


Examples for Falsely predicted Failure when it should have been Success: (Correct answer: Success, What the previous set of rules predicted: Failure)
  Entity Data:
	Histogen_Complex: Delta
	Sex: female
	Treatment_Months: 89.09735294117647
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 56495.8
	Cohort: Melbourne


Examples for Correctly predicted Success: (Correct answer: Success, What the previous set of rules predicted: Success)
  Entity Data:
	Histogen_Complex: Beta
	Sex: male
	Treatment_Months: 90.0
	Genetic_Class_A_Matches: 1
	Genetic_Class_B_Matches: 1
	TcQ_mass: 86500.0
	Cohort: Melbourne


Ensemble Confusion Matrix

Predicted +Predicted -
Actual +235
Actual -716

Accuracy 0.765, Precision 0.767, Recall 0.821, F1 0.793