Model geminipro10

Training model: gemini-2.0-pro-exp
Inference model: gpt-4o-mini

Investigations

Performance

DatasetValidation accuracyTest accuracy
espionage0.650.4
potions0.750.65
timetravel_insurance0.950.8
wisconsin0.71428571428571430.5909090909090909