Dataset: southgermancredit
Round ID: 306 Prompt used: You are given ONE reef‑survey record. Decide whether the reef is "Healthy" or "Stressed" by following the steps below EXACTLY and DETERMINISTICALLY. Output ONLY the single word Healthy or Stressed. STEP 0 – HEALTH OVERRIDE (apply first) If ALL of the following are true, immediately output Healthy: • ReefIntegrityScore = Excellent • PredatorActivityLevel is "None" or "Very Low" • AcousticIntensity < 300 • BiodiversityIndex = High STEP 1 – STRESS OVERRIDES (apply next; if any match, output Stressed) A) PollutionLevel is High or Critical AND InvasiveSpeciesDetected = Yes AND ReefIntegrityScore is Moderate, Poor or Very Poor B) BleachingEventsPerYear = Constant AND AcousticIntensity < 500 If neither override fired, continue to Step 2. STEP 2 – ASSIGN RISK POINTS (a) BleachingEventsPerYear Rare = 0 Occasional = 1 Frequent = 2 Constant = 2 (b) PollutionLevel Low = 0 Moderate = 1 High = 3 Critical = 4 (c) ReefIntegrityScore Excellent = 0 Good = 1 Moderate = 2 Poor = 3 Very Poor = 4 (d) PredatorActivityLevel (treat "Moderate Low" as Moderate; treat "High", "Severe" or "Extreme" the same) None / Very Low = 0 Moderate / Moderate Low = 1 High / Severe / Extreme = 1 (e) InvasiveSpeciesDetected Yes = 1 No = 0 (f) AcousticIntensity (numeric) > 900 = 2 500–900 = 1 < 500 = 0 (g) DistantStressIndicators Confirmed = 1 Possible or Absent = 0 (h) PreviousStressIncidents None / Minor = 0 Moderate = 1 Severe or worse = 2 STEP 3 – CALCULATE TOTAL RISK Add all points from Step 2. STEP 4 – FINAL DECISION If TOTAL RISK ≥ 9 → Stressed Otherwise → Healthy Do not add any explanations or extra words. Confusion Matrix: Predicted Healthy Predicted Stressed Actual Healthy 37 51 Actual Stressed 6 19 Accuracy: 0.496 Precision: 0.860 Recall: 0.420 F1 Score: 0.565 Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy) Entity Data: CurrentFlowQuality: Fair ObservationDuration: 70 ReefIntegrityScore: Excellent PredatorActivityLevel: Low AcousticIntensity: 276.5 AlgalCoverage: Heavy CoralAgeEstimate: 21 BleachingEventsPerYear: Frequent BiodiversityIndex: High NearbyHealthyReef: Absent ReefMonitoringDuration: 1-3 years PollutionLevel: High ReefAverageAge: 35 DistantStressIndicators: Confirmed ReefDepthZone: Mid-depth PreviousStressIncidents: Minor CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: Yes Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed) Entity Data: CurrentFlowQuality: Excellent ObservationDuration: 43 ReefIntegrityScore: Moderate PredatorActivityLevel: None AcousticIntensity: 241.0 AlgalCoverage: Extensive CoralAgeEstimate: 26 BleachingEventsPerYear: Constant BiodiversityIndex: High NearbyHealthyReef: Absent ReefMonitoringDuration: 4-6 years PollutionLevel: High ReefAverageAge: 53 DistantStressIndicators: Confirmed ReefDepthZone: Mid-depth PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: Yes Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy) Entity Data: CurrentFlowQuality: Poor ObservationDuration: 55 ReefIntegrityScore: Excellent PredatorActivityLevel: None AcousticIntensity: 264.5 AlgalCoverage: Minimal CoralAgeEstimate: 26 BleachingEventsPerYear: Occasional BiodiversityIndex: High NearbyHealthyReef: Adjacent ReefMonitoringDuration: > 6 years PollutionLevel: Moderate ReefAverageAge: 46 DistantStressIndicators: Confirmed ReefDepthZone: Shallow PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: No Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed) Entity Data: CurrentFlowQuality: Fair ObservationDuration: 25 ReefIntegrityScore: Moderate PredatorActivityLevel: None AcousticIntensity: 1457.5 AlgalCoverage: Extensive CoralAgeEstimate: 6 BleachingEventsPerYear: Rare BiodiversityIndex: High NearbyHealthyReef: Absent ReefMonitoringDuration: 1-3 years PollutionLevel: Moderate ReefAverageAge: 26 DistantStressIndicators: Confirmed ReefDepthZone: Mid-depth PreviousStressIncidents: None CoralDominantType: Branching SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: Yes
Round ID: 96 Prompt used: Predict whether a coral reef is "Healthy" or "Stressed" based on the following rules: A reef should be classified as "Stressed" if ANY of these conditions are met: 1. ReefIntegrityScore is "Poor" OR "Very Poor" 2. CoralAgeEstimate is less than or equal to 11 years AND ReefAverageAge is less than 30 years 3. ObservationDuration is greater than 100 days 4. CurrentFlowQuality is "Poor" AND ReefIntegrityScore is "Moderate" or worse 5. PredatorActivityLevel is "High" OR "Severe" 6. ReefAverageAge is less than 25 years AND CurrentFlowQuality is not "Excellent" 7. CoralAgeEstimate is 6 years or less Otherwise, classify the reef as "Healthy". When evaluating conditions: - For ReefIntegrityScore, the order from best to worst is: Excellent > Good > Moderate > Poor > Very Poor - For CurrentFlowQuality, the order from best to worst is: Excellent > Good > Fair > Poor - For numeric values, use exact comparisons (e.g., "less than or equal to 11" means ≤ 11) Confusion Matrix: Predicted Healthy Predicted Stressed Actual Healthy 53 35 Actual Stressed 6 19 Accuracy: 0.637 Precision: 0.898 Recall: 0.602 F1 Score: 0.721 Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy) Entity Data: CurrentFlowQuality: Good ObservationDuration: 79 ReefIntegrityScore: Moderate PredatorActivityLevel: Moderate Low AcousticIntensity: 517.2 AlgalCoverage: Minimal CoralAgeEstimate: 21 BleachingEventsPerYear: Constant BiodiversityIndex: High NearbyHealthyReef: Absent ReefMonitoringDuration: 1-3 years PollutionLevel: High ReefAverageAge: 28 DistantStressIndicators: None ReefDepthZone: Mid-depth PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: No InvasiveSpeciesDetected: Yes Examples for Falsely predicted Stressed when it should have been Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Stressed) Entity Data: CurrentFlowQuality: Excellent ObservationDuration: 187 ReefIntegrityScore: Moderate PredatorActivityLevel: Moderate Low AcousticIntensity: 1016.4 AlgalCoverage: Sparse CoralAgeEstimate: 21 BleachingEventsPerYear: Occasional BiodiversityIndex: Medium NearbyHealthyReef: Absent ReefMonitoringDuration: > 6 years PollutionLevel: Low ReefAverageAge: 24 DistantStressIndicators: Confirmed ReefDepthZone: Mid-depth PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: Yes Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy) Entity Data: CurrentFlowQuality: Poor ObservationDuration: 67 ReefIntegrityScore: Excellent PredatorActivityLevel: None AcousticIntensity: 225.5 AlgalCoverage: Minimal CoralAgeEstimate: 16 BleachingEventsPerYear: Constant BiodiversityIndex: Very High NearbyHealthyReef: Adjacent ReefMonitoringDuration: 1-3 years PollutionLevel: Moderate ReefAverageAge: 36 DistantStressIndicators: None ReefDepthZone: Shallow PreviousStressIncidents: Minor CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: No InvasiveSpeciesDetected: No Examples for Correctly predicted Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Stressed) Entity Data: CurrentFlowQuality: Excellent ObservationDuration: 61 ReefIntegrityScore: Good PredatorActivityLevel: Severe AcousticIntensity: 218.9 AlgalCoverage: Minimal CoralAgeEstimate: 16 BleachingEventsPerYear: Constant BiodiversityIndex: Very High NearbyHealthyReef: Absent ReefMonitoringDuration: 1-3 years PollutionLevel: High ReefAverageAge: 31 DistantStressIndicators: Confirmed ReefDepthZone: Mid-depth PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: Yes
Round ID: 349 Prompt used: Given the entity data for a reef observation, predict the reef status as Healthy or Stressed based on the following rule: If Observation Duration is less than or equal to 50, Predator Activity Level is Severe, Bleaching Events Per Year is Frequent, and Reef Integrity Score is Poor or Fair, predict Stressed. Otherwise, predict Healthy. Confusion Matrix: Predicted Healthy Predicted Stressed Actual Healthy 88 0 Actual Stressed 25 0 Accuracy: 0.779 Precision: 0.779 Recall: 1.000 F1 Score: 0.876 Examples for Correctly predicted Healthy: (Correct answer: Healthy, What the previous set of rules predicted: Healthy) Entity Data: CurrentFlowQuality: Fair ObservationDuration: 115 ReefIntegrityScore: Good PredatorActivityLevel: None AcousticIntensity: 288.2 AlgalCoverage: Sparse CoralAgeEstimate: 26 BleachingEventsPerYear: Constant BiodiversityIndex: High NearbyHealthyReef: Absent ReefMonitoringDuration: 4-6 years PollutionLevel: Critical ReefAverageAge: 33 DistantStressIndicators: Confirmed ReefDepthZone: Deep PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: No InvasiveSpeciesDetected: Yes Examples for Falsely predicted Healthy when it should have been Stressed: (Correct answer: Stressed, What the previous set of rules predicted: Healthy) Entity Data: CurrentFlowQuality: Poor ObservationDuration: 151 ReefIntegrityScore: Moderate PredatorActivityLevel: Moderate Low AcousticIntensity: 677.8 AlgalCoverage: Minimal CoralAgeEstimate: 16 BleachingEventsPerYear: Frequent BiodiversityIndex: Medium NearbyHealthyReef: Absent ReefMonitoringDuration: 1-3 years PollutionLevel: High ReefAverageAge: 34 DistantStressIndicators: Confirmed ReefDepthZone: Mid-depth PreviousStressIncidents: None CoralDominantType: Encrusting SurveyorExperience: Yes RemoteSensorPresent: Yes InvasiveSpeciesDetected: Yes
Predicted + | Predicted - | |
---|---|---|
Actual + | 66 | 22 |
Actual - | 8 | 17 |
Accuracy 0.735, Precision 0.892, Recall 0.750, F1 0.815