Model geminipro

Training model: gemini-2.0-pro-exp
Inference model: gpt-4o-mini

Investigations

Performance

DatasetValidation accuracyTest accuracy
espionage0.950.8
potions0.950.6
southgermancredit0.50.5221238938053098
wisconsin0.68253968253968260.5454545454545454