A Framework for Formulation of Student Dataset Using Existing and Novel Features for Analysis
One major problem identified with most schools in Nigeria is that they lack structured educational datasets that is composed of several attributes related to each student, such as term-based grades, courses taken, student-specific details, and absences which could be easily analysed. This paper form...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
INTI International University
2023
|
Subjects: | |
Online Access: | http://eprints.intimal.edu.my/1776/1/92 http://eprints.intimal.edu.my/1776/ https://intijournal.intimal.edu.my |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | One major problem identified with most schools in Nigeria is that they lack structured educational datasets that is composed of several attributes related to each student, such as term-based grades, courses taken, student-specific details, and absences which could be easily analysed. This paper formulates a dataset with some novel features for analysing and predicting student performance. Apart from the current features like age, grade, number of failures etc. some novel features which consists of environmental factors were proposed. Students’ records were collected from schools and surveys on schools’ infrastructure were collected using a questionnaire. The data were analysed using NumPy and Pandas in python. Random forest was used as classifier for making prediction and detecting important features. The following features were found to influence the model decision in making decision; Average, Number of failures, students score in all the subjects, school type, portable drinking water, availability of electricity, textbook to student ratio, and availability of laboratory reagents. Four of the proposed features were among the most important features. In addition, the model was excellent in making prediction. Results of the analysis shows that there are more male than females in the dataset, this means that government, non-governmental organization and the society needs to promote and encourage girl child education. |
---|