Ensemble

Dataset: southgermancredit

Models

Model Narratives

gpt-4o-legacy10

Round ID: 325
Prompt used:
	Choose randomly

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    43                   45
Actual Stressed                   11                   14

Accuracy: 0.504
Precision: 0.796
Recall: 0.489
F1 Score: 0.606

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 151
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 850.7
	AlgalCoverage: Extensive
	CoralAgeEstimate: 21
	BleachingEventsPerYear: Rare
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: High
	ReefAverageAge: 27
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 61
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Low
	AcousticIntensity: 344.2
	AlgalCoverage: Minimal
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Constant
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Moderate
	ReefAverageAge: 50
	DistantStressIndicators: None
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: Moderate
	CoralDominantType: Encrusting
	SurveyorExperience: No
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Poor
	ObservationDuration: 79
	ReefIntegrityScore: Very Poor
	PredatorActivityLevel: Low
	AcousticIntensity: 413.0
	AlgalCoverage: Minimal
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Critical
	ReefAverageAge: 26
	DistantStressIndicators: None
	ReefDepthZone: Shallow
	PreviousStressIncidents: Minor
	CoralDominantType: Encrusting
	SurveyorExperience: No
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 169
	ReefIntegrityScore: Very Poor
	PredatorActivityLevel: Severe
	AcousticIntensity: 1596.5
	AlgalCoverage: Minimal
	CoralAgeEstimate: 11
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Critical
	ReefAverageAge: 61
	DistantStressIndicators: Confirmed
	ReefDepthZone: Shallow
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


openailong

Round ID: 215
Prompt used:
	Based on the following criteria, classify the reef as either 'Healthy' or 'Stressed':
	
	1. **Current Flow Quality:** 
	   - Excellent: +2 points
	   - Good: +1 point
	   - Fair: -1 point
	   - Poor: -2 points
	
	2. **Observation Duration:** 
	   - > 6 years: +2 points
	   - 4-6 years: +1 point
	   - 1-4 years: 0 points
	   - < 1 year: -1 point
	
	3. **Reef Integrity Score:** 
	   - Excellent: +3 points
	   - Good: +2 points
	   - Moderate: +1 point
	   - Poor or Very Poor: -1 point
	
	4. **Pollution Level:** 
	   - Low: +2 points
	   - Moderate: +1 point
	   - High: -2 points
	   - Critical: -3 points
	
	5. **Predator Activity Level:** 
	   - Very Low or None: +2 points
	   - Low: +1 point
	   - Moderate Low: 0 points
	   - Moderate: -1 point
	   - High or Severe: -2 points
	
	6. **Biodiversity Index:** 
	   - High: +2 points
	   - Medium: +1 point
	   - Low: -1 point
	
	7. **Nearby Healthy Reef Presence:** 
	   - Adjacent or Present: +2 points
	   - Absent: -2 points
	
	8. **Previous Stress Incidents:** 
	   - None: +2 points
	   - Minor: +1 point
	   - Moderate or Major: -3 points
	
	Calculate the total score. If the score is 6 or higher, classify the reef as 'Healthy.' If the score is less than 6, classify it as 'Stressed.'

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    43                   45
Actual Stressed                    7                   18

Accuracy: 0.540
Precision: 0.860
Recall: 0.489
F1 Score: 0.623

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 79
	ReefIntegrityScore: Excellent
	PredatorActivityLevel: Catastrophic
	AcousticIntensity: 633.4
	AlgalCoverage: Minimal
	CoralAgeEstimate: 6
	BleachingEventsPerYear: Constant
	BiodiversityIndex: High
	NearbyHealthyReef: Nearby
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Critical
	ReefAverageAge: 30
	DistantStressIndicators: None
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: Minor
	CoralDominantType: Soft
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Good
	ObservationDuration: 43
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Moderate Low
	AcousticIntensity: 341.9
	AlgalCoverage: Extensive
	CoralAgeEstimate: 26
	BleachingEventsPerYear: Occasional
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 4-6 years
	PollutionLevel: High
	ReefAverageAge: 40
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Soft
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 61
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 106.2
	AlgalCoverage: Extensive
	CoralAgeEstimate: 16
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 1-3 years
	PollutionLevel: Moderate
	ReefAverageAge: 36
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: No
	InvasiveSpeciesDetected: Yes


Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 169
	ReefIntegrityScore: Very Poor
	PredatorActivityLevel: Severe
	AcousticIntensity: 1596.5
	AlgalCoverage: Minimal
	CoralAgeEstimate: 11
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: High
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Critical
	ReefAverageAge: 61
	DistantStressIndicators: Confirmed
	ReefDepthZone: Shallow
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


openai35

Round ID: 349
Prompt used:
	Given the entity data for a reef observation, predict the reef status as Healthy or Stressed based on the following rule: If Observation Duration is less than or equal to 50, Predator Activity Level is Severe, Bleaching Events Per Year is Frequent, and Reef Integrity Score is Poor or Fair, predict Stressed. Otherwise, predict Healthy.

Confusion Matrix:
                Predicted Healthy    Predicted Stressed  
Actual Healthy                    88                    0
Actual Stressed                   25                    0

Accuracy: 0.779
Precision: 0.779
Recall: 1.000
F1 Score: 0.876

Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Excellent
	ObservationDuration: 79
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: None
	AcousticIntensity: 154.5
	AlgalCoverage: Heavy
	CoralAgeEstimate: 21
	BleachingEventsPerYear: Constant
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: 4-6 years
	PollutionLevel: High
	ReefAverageAge: 37
	DistantStressIndicators: Confirmed
	ReefDepthZone: Mid-depth
	PreviousStressIncidents: None
	CoralDominantType: Encrusting
	SurveyorExperience: No
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy)
  Entity Data:
	CurrentFlowQuality: Fair
	ObservationDuration: 61
	ReefIntegrityScore: Moderate
	PredatorActivityLevel: Very Low
	AcousticIntensity: 1299.6
	AlgalCoverage: Minimal
	CoralAgeEstimate: 6
	BleachingEventsPerYear: Frequent
	BiodiversityIndex: Medium
	NearbyHealthyReef: Absent
	ReefMonitoringDuration: > 6 years
	PollutionLevel: Critical
	ReefAverageAge: 41
	DistantStressIndicators: Confirmed
	ReefDepthZone: Deep
	PreviousStressIncidents: None
	CoralDominantType: Soft
	SurveyorExperience: Yes
	RemoteSensorPresent: Yes
	InvasiveSpeciesDetected: Yes


Ensemble Confusion Matrix

Predicted +Predicted -
Actual +6721
Actual -169

Accuracy 0.673, Precision 0.807, Recall 0.761, F1 0.784