PRICOPE, Tidor-Vlad. “HardML: A Benchmark for Evaluating Data Science and Machine Learning Knowledge and Reasoning in AI”. Studia Universitatis Babeș-Bolyai Informatica, vol. 69, no. 2, Mar. 2025, pp. 59-76, doi:10.24193/subbi.2024.2.04.