Ensemble

Dataset: southgermancredit

Models

Model Narratives

openaio3

Round ID: 306
Prompt used:
	You are given ONE reef‑survey record.  Decide whether the reef is "Healthy" or "Stressed" by following the steps below EXACTLY and DETERMINISTICALLY.  Output ONLY the single word Healthy or Stressed.
	
	STEP 0 – HEALTH OVERRIDE (apply first)
	If ALL of the following are true, immediately output Healthy:
	  • ReefIntegrityScore = Excellent
	  • PredatorActivityLevel is "None" or "Very Low"
	  • AcousticIntensity < 300
	  • BiodiversityIndex = High
	
	STEP 1 – STRESS OVERRIDES (apply next; if any match, output Stressed)
	A)  PollutionLevel is High or Critical AND InvasiveSpeciesDetected = Yes AND ReefIntegrityScore is Moderate, Poor or Very Poor
	B)  BleachingEventsPerYear = Constant AND AcousticIntensity < 500
	
	If neither override fired, continue to Step 2.
	
	STEP 2 – ASSIGN RISK POINTS
	(a) BleachingEventsPerYear
	      Rare = 0   Occasional = 1   Frequent = 2   Constant = 2
	(b) PollutionLevel
	      Low = 0   Moderate = 1   High = 3   Critical = 4
	(c) ReefIntegrityScore
	      Excellent = 0   Good = 1   Moderate = 2   Poor = 3   Very Poor = 4
	(d) PredatorActivityLevel   (treat "Moderate Low" as Moderate; treat "High", "Severe" or "Extreme" the same)
	      None / Very Low = 0   Moderate / Moderate Low = 1   High / Severe / Extreme = 1
	(e) InvasiveSpeciesDetected   Yes = 1   No = 0
	(f) AcousticIntensity (numeric)
	      > 900 = 2   500–900 = 1   < 500 = 0
	(g) DistantStressIndicators
	      Confirmed = 1   Possible or Absent = 0
	(h) PreviousStressIncidents
	      None / Minor = 0   Moderate = 1   Severe or worse = 2
	
	STEP 3 – CALCULATE TOTAL RISK
	Add all points from Step 2.
	
	STEP 4 – FINAL DECISION
	If TOTAL RISK ≥ 9  →  Stressed
	Otherwise            →  Healthy
	
	Do not add any explanations or extra words.

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    37                   51
Actual Stressed                    6                   19

Accuracy: 0.496
Precision: 0.860
Recall: 0.420
F1 Score: 0.565

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 70
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: Low
	AcousticIntensity: 276.5
	AlgalCoverage: Heavy
	CoralAgeEstimate: 21
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: High
	ReefAverageAge: 35
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: Minor
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 43
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 241.0
	AlgalCoverage: Extensive
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Constant
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 4-6 years
	PollutionLevel: High
	ReefAverageAge: 53
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 55
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: None
	AcousticIntensity: 264.5
	AlgalCoverage: Minimal
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Occasional
	BiodiversityIndex: High
	NearbyHealthyReef: Adjacent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Moderate
	ReefAverageAge: 46
	DistantStressIndicators: Confirmed
	ReefDepthZone: Shallow
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: No


Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 25
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 1457.5
	AlgalCoverage: Extensive
	CoralAgeEstimate: 6
	BleachingEventsPerYear: Rare
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Moderate
	ReefAverageAge: 26
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Branching
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


opus4010

Round ID: 96
Prompt used:
	Predict whether a coral reef is "Healthy" or "Stressed" based on the following rules:
	
	A reef should be classified as "Stressed" if ANY of these conditions are met:
	
	1. ReefIntegrityScore is "Poor" OR "Very Poor"
	
	2. CoralAgeEstimate is less than or equal to 11 years AND ReefAverageAge is less than 30 years
	
	3. ObservationDuration is greater than 100 days
	
	4. CurrentFlowQuality is "Poor" AND ReefIntegrityScore is "Moderate" or worse
	
	5. PredatorActivityLevel is "High" OR "Severe"
	
	6. ReefAverageAge is less than 25 years AND CurrentFlowQuality is not "Excellent"
	
	7. CoralAgeEstimate is 6 years or less
	
	Otherwise, classify the reef as "Healthy".
	
	When evaluating conditions:
	- For ReefIntegrityScore, the order from best to worst is: Excellent > Good > Moderate > Poor > Very Poor
	- For CurrentFlowQuality, the order from best to worst is: Excellent > Good > Fair > Poor
	- For numeric values, use exact comparisons (e.g., "less than or equal to 11" means ≤ 11)

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    53                   35
Actual Stressed                    6                   19

Accuracy: 0.637
Precision: 0.898
Recall: 0.602
F1 Score: 0.721

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Good
	ObservationDuration: 79
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 517.2
	AlgalCoverage: Minimal
	CoralAgeEstimate: 21
	BleachingEventsPerYear: Constant
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: High
	ReefAverageAge: 28
	DistantStressIndicators: None
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 187
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 1016.4
	AlgalCoverage: Sparse
	CoralAgeEstimate: 21
	BleachingEventsPerYear: Occasional
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Low
	ReefAverageAge: 24
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 67
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: None
	AcousticIntensity: 225.5
	AlgalCoverage: Minimal
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Very High
	NearbyHealthyReef: Adjacent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Moderate
	ReefAverageAge: 36
	DistantStressIndicators: None
	ReefDepthZone: Shallow
	PreviousStressIncidents: Minor
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: No


Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 61
	ReefIntegrityScore: Good
	PredatorActivityLevel: Severe
	AcousticIntensity: 218.9
	AlgalCoverage: Minimal
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Very High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: High
	ReefAverageAge: 31
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


openai35

Round ID: 349
Prompt used:
	Given the entity data for a reef observation, predict the reef status as Healthy or Stressed based on the following rule: If Observation Duration is less than or equal to 50, Predator Activity Level is Severe, Bleaching Events Per Year is Frequent, and Reef Integrity Score is Poor or Fair, predict Stressed. Otherwise, predict Healthy.

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    88                    0
Actual Stressed                   25                    0

Accuracy: 0.779
Precision: 0.779
Recall: 1.000
F1 Score: 0.876

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 115
	ReefIntegrityScore: Good
	PredatorActivityLevel: None
	AcousticIntensity: 288.2
	AlgalCoverage: Sparse
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Constant
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 4-6 years
	PollutionLevel: Critical
	ReefAverageAge: 33
	DistantStressIndicators: Confirmed
	ReefDepthZone: Deep
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 151
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 677.8
	AlgalCoverage: Minimal
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: High
	ReefAverageAge: 34
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Ensemble Confusion Matrix

Predicted +Predicted -
Actual +6622
Actual -817

Accuracy 0.735, Precision 0.892, Recall 0.750, F1 0.815