Ensemble

Dataset: southgermancredit

Models

Model Narratives

anthropic3710

Round ID: 299
Prompt used:
	# Coral Reef Health Classification Rules
	
	Evaluate each coral reef using the following set of rules to determine if it is HEALTHY or STRESSED. Apply these rules sequentially and classify accordingly.
	
	## Rule 1: Current Flow and Integrity Assessment
	- If CurrentFlowQuality is "Poor" AND ReefIntegrityScore is "Very Poor", classify as STRESSED.
	- If CurrentFlowQuality is "Excellent" AND ReefIntegrityScore is "Excellent" or "Good", classify as HEALTHY unless contradicted by Rule 6.
	- If CurrentFlowQuality is "Poor" but ReefIntegrityScore is "Excellent", evaluate other positive factors before classification (see Rule 5).
	
	## Rule 2: Pollution and Biodiversity Assessment
	- If PollutionLevel is "Critical" AND BiodiversityIndex is "Low", classify as STRESSED.
	- If BiodiversityIndex is "High" or "Very High", consider the reef POTENTIALLY HEALTHY (confirm with Rules 5 and 6).
	- If PollutionLevel is "Critical" or "High" but BiodiversityIndex is "High" or "Very High" AND ReefAverageAge > 35, consider the reef POTENTIALLY HEALTHY (confirm with Rules 5 and 6).
	
	## Rule 3: Reef and Coral Age Evaluation
	- If CoralAgeEstimate ≤ 11 AND ReefAverageAge < 28, classify as STRESSED unless BiodiversityIndex is "High" or "Very High".
	- If CoralAgeEstimate ≥ 16 AND ReefAverageAge > 35 AND BiodiversityIndex is not "Low", consider the reef POTENTIALLY HEALTHY (confirm with Rules 5 and 6).
	
	## Rule 4: Bleaching and Algal Coverage Assessment
	- If BleachingEventsPerYear is "Constant" AND AlgalCoverage is "Extensive" AND BiodiversityIndex is "Low" or "Medium", classify as STRESSED.
	- If BleachingEventsPerYear is "Constant" but BiodiversityIndex is "High" or "Very High" AND ReefIntegrityScore is "Excellent" or "Good", this is not automatically indicative of stress (evaluate using Rules 5 and 6).
	
	## Rule 5: Positive Health Indicators
	If THREE OR MORE of the following conditions are met, classify as HEALTHY unless Rule 6 applies:
	- CurrentFlowQuality is "Excellent" or "Good"
	- BiodiversityIndex is "High" or "Very High"
	- AlgalCoverage is "Minimal" or "Sparse"
	- ReefIntegrityScore is "Excellent" or "Good"
	- PollutionLevel is "Low" or "Moderate"
	- ReefAverageAge > 35
	- CoralAgeEstimate > 15
	
	## Rule 6: Critical Stress Indicators
	If ANY THREE of the following conditions are met, classify as STRESSED regardless of other factors:
	- AcousticIntensity > 850
	- PollutionLevel is "Critical" AND BiodiversityIndex is not "High" or "Very High"
	- CurrentFlowQuality is "Poor" AND ReefIntegrityScore is "Very Poor" or "Poor"
	- BleachingEventsPerYear is "Frequent" AND AlgalCoverage is "Extensive"
	- CoralAgeEstimate < 10 AND ReefAverageAge < 25
	
	## Rule 7: Additional Health Indicators
	If a reef meets TWO of these conditions, classify as HEALTHY unless it meets Rule 6:
	- ReefIntegrityScore is "Excellent" AND any one other positive indicator from Rule 5
	- BiodiversityIndex is "High" or "Very High" AND ReefAverageAge > 35
	- BiodiversityIndex is "High" or "Very High" AND AlgalCoverage is "Minimal"
	
	## Default Rule
	If no clear classification has been made from the above rules:
	- If TWO OR MORE of these key indicators (BiodiversityIndex, ReefIntegrityScore, CurrentFlowQuality, PollutionLevel) are positive, AND AcousticIntensity < 600, classify as HEALTHY.
	- If TWO OR MORE of these key indicators are negative, OR AcousticIntensity > 600 AND either PollutionLevel is "High"/"Critical" or CurrentFlowQuality is "Poor", classify as STRESSED.
	- In uncertain cases where there is a balanced mix of positive and negative indicators, prioritize BiodiversityIndex and ReefIntegrityScore in the final decision.
	
	When evaluating, apply rules in sequence but prioritize Rules 5 and 6 when their conditions are conclusively met.

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    48                   40
Actual Stressed                   10                   15

Accuracy: 0.558
Precision: 0.828
Recall: 0.545
F1 Score: 0.658

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 52
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 155.7
	AlgalCoverage: Extensive
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Constant
	BiodiversityIndex: High
	NearbyHealthyReef: Adjacent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Low
	ReefAverageAge: 53
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: Minor
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 187
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 1016.4
	AlgalCoverage: Sparse
	CoralAgeEstimate: 21
	BleachingEventsPerYear: Occasional
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Low
	ReefAverageAge: 24
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 67
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: None
	AcousticIntensity: 225.5
	AlgalCoverage: Minimal
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Very High
	NearbyHealthyReef: Adjacent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Moderate
	ReefAverageAge: 36
	DistantStressIndicators: None
	ReefDepthZone: Shallow
	PreviousStressIncidents: Minor
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: No


Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 79
	ReefIntegrityScore: Very Poor
	PredatorActivityLevel: Low
	AcousticIntensity: 413.0
	AlgalCoverage: Minimal
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Critical
	ReefAverageAge: 26
	DistantStressIndicators: None
	ReefDepthZone: Shallow
	PreviousStressIncidents: Minor
	CoralDominantType: Encrusting
	SurveyorExperience: No
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


openaio1

Round ID: 336
Prompt used:
	Please use the following explicit rules to classify each row as either "Healthy" or "Stressed" reef:
	
	1) If PredatorActivityLevel is "Severe," label the reef as "Stressed."
	2) If CurrentFlowQuality is "Poor" AND DistantStressIndicators is "Confirmed," label as "Stressed."  
	   (This helps catch reefs that are actually stressed but previously got labeled healthy.)
	3) If DistantStressIndicators is "None" AND PreviousStressIncidents is "None," label as "Healthy."  
	   (This addresses reefs that have only minor or no other risks but were mistakenly labeled stressed.)
	4) If CurrentFlowQuality is "Excellent" or "Good" AND PredatorActivityLevel is not "Severe," label as "Healthy." 
	5) In all other cases not covered by the rules above, label as "Stressed."  
	
	Make sure to apply the rules in the order listed. Only give a single classification—either "Healthy" or "Stressed"—for each entity.

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    41                   47
Actual Stressed                    4                   21

Accuracy: 0.549
Precision: 0.911
Recall: 0.466
F1 Score: 0.617

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 43
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: Low
	AcousticIntensity: 335.1
	AlgalCoverage: Minimal
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Occasional
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Moderate
	ReefAverageAge: 45
	DistantStressIndicators: Possible
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 43
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 72.1
	AlgalCoverage: Minimal
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Very High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Low
	ReefAverageAge: 43
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Massive
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 43
	ReefIntegrityScore: Poor
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 64.6
	AlgalCoverage: Minimal
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Low
	ReefAverageAge: 27
	DistantStressIndicators: None
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Massive
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 25
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 1457.5
	AlgalCoverage: Extensive
	CoralAgeEstimate: 6
	BleachingEventsPerYear: Rare
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Moderate
	ReefAverageAge: 26
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Branching
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


openai35

Round ID: 349
Prompt used:
	Given the entity data for a reef observation, predict the reef status as Healthy or Stressed based on the following rule: If Observation Duration is less than or equal to 50, Predator Activity Level is Severe, Bleaching Events Per Year is Frequent, and Reef Integrity Score is Poor or Fair, predict Stressed. Otherwise, predict Healthy.

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    88                    0
Actual Stressed                   25                    0

Accuracy: 0.779
Precision: 0.779
Recall: 1.000
F1 Score: 0.876

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 52
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 358.8
	AlgalCoverage: Minimal
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: High
	ReefAverageAge: 57
	DistantStressIndicators: None
	ReefDepthZone: Shallow
	PreviousStressIncidents: None
	CoralDominantType: Soft
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 34
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 144.2
	AlgalCoverage: Minimal
	CoralAgeEstimate: 11
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Critical
	ReefAverageAge: 30
	DistantStressIndicators: Confirmed
	ReefDepthZone: Deep
	PreviousStressIncidents: None
	CoralDominantType: Soft
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Ensemble Confusion Matrix

Predicted +Predicted -
Actual +6622
Actual -1213

Accuracy 0.699, Precision 0.846, Recall 0.750, F1 0.795