It sounds like you're looking for information on (collections of email and password combinations from educational domains).
Use Have I Been Pwned to see if your EDU email is in a known combo list. Download EDU combo txt
While I've focused on the of these lists, were you instead looking for a technical explanation of how security researchers analyze these files, or perhaps a different type of educational dataset for data science? It sounds like you're looking for information on
Many companies offer "Student Discounts" or free software (Adobe, Office 365) to these addresses. Many companies offer "Student Discounts" or free software
These lists are usually sourced from old website leaks or phishing campaigns.
Hackers use these lists to "stuff" logins into other sites like Amazon or Netflix.
If you are a student or faculty member, take these steps to secure your EDU account: Enable Multi-Factor Authentication immediately.