Developing Multiagent E-Learning System-Based Machine Learning and Feature Selection Techniques

Recently, artificial intelligence (AI) domain increased to contain finance, education, health, mining, and education. Artificial intelligence controls the performance of systems that use new technologies, especially in the education environment. The multiagent system (MAS) is considered an intelligent system to facilitate the e-learning process in the educational environment. MAS is used to make interaction easily among agents, which supports the use of feature selection. The feature selection methods are used to select the important and relevant features from the database that could help machine learning algorithms produce high performance. This paper aims to propose an effective and suitable system for multiagent-based machine learning algorithms and feature selection methods to enhance the e-learning process in the educational environment which predicts pass or fail results. The univariate and Extra Trees feature selection methods are used to select the essential attributes from the database. Five machine learning algorithms named Decision Tree (DT), Logistic Regression (LR), Random Forest (RF), Naive Bayes (NB), and K-nearest neighbors algorithm (KNN) are applied to all features and selected features. The results showed that the learning algorithm that has been measured by the Extra Trees method has achieved the highest performance depending on the evaluation of cross-validation and testing.


Introduction
During the last two years, global disasters have occurred, so all people are forced to use technologies to get their services remotely [1]. Technologies could allow users to achieve the appropriate task at a low cost and save time. Artifcial intelligence (AI) is a trending topic in current days, which allows machine learning to be implemented for efciency and performance [2]. Education, health, industry, and fnance use artifcial intelligence to develop their felds. Rapidly increasing education environment needs to use machine learning techniques which are considered one of the faces for AI [3].
Enhancing learning systems, especially e-learning systems in the educational process, has become necessary for the educational environment. Using an intelligent system is our target to organize the e-learning process. Continuous changes in the e-learning process led to emerging suitable techniques to deal with the requirements of students [4][5][6]. Te e-learning system could allow students to use the benefts of this application anywhere, anytime. When using the e-learning system, there is a need to be supported by the multiagent model to cover the shortage of educational environment. Since many attributes are used in the e-learning process, multiagent is the best solution for the e-learning system. Agents could interact with others in the same environment, so the multiagent system could allow integration between agents [7,8]. Te multiagent system could allow e-learning attributes to interact and discover their relationship. In the e-learning system, students could use many features to enhance their performance [9]. Proposing the multiagent system could assist e-learning systems in improving various tasks for students. Using artifcial intelligence systems is the way to enhance the performance of students by using feature selection methods [10,11].
Machine learning algorithms are used to identify and predict the data to produce the best solution for a decision [3]. Machine learning algorithms are playing an essential role in diferent felds [12,13] and especially in the feld of education [14,15]. Machine learning algorithms play an essential role in the educational process and feature selection algorithms. Feature selection algorithms could select only relevant features for high prediction by using various algorithms [16,17]. Many feature selection algorithms could be used for efciency, revealing irrelevant features.
Tis paper proposed an education system using multiagents to study interactive agents' efects to enhance e-learning. We integrated diferent agents: course, student, and diferent activities, and we applied diferent feature selection methods to select the most attributes that are playing an important role in enhancing the e-learning process. We applied fve machine learning algorithms on selected features and evaluated ML algorithms' performance using diferent measurement methods to enhance the efect of the feature selection methods on the performance of the educational process.
In the next section, the literature review will illustrate the related work of predicting performance using machine learning algorithms and the features that afect the prediction of the education system. Section 3 displays the main steps of the proposed system. Section 4 discusses the results of applying ML algorithms on the selected features. Section 5 provides a summary of the paper.

Related Works
Machine learning (ML) is an implementation part of artifcial intelligence (AI) that enables the machine to learn from data to complete the task efciently. It is considered a backbone of artifcial intelligence approaches that are used to develop the prediction to enhance performance [2,18].
Feature selection (FS) is considered important data before deploying machine learning algorithms [16,17]. Te feature selection could select only relevant and essential features from the data and ignore the redundant data [19]. Many researchers have used feature selection methods and machine learning algorithms to improve the educational process. For example, in [20], the authors proposed a learning system that implements a fuzzy methodology to detect the failure of students. Te activity of students, subjects, and their background in education are the factors that afect performance. Tey used multicriteria of the fuzzy algorithm to get the rank of students which predict the score. Te dataset consisted of 3 institutions that contained 131 students with 22 attributes.
In [21], the authors used machine learning algorithms to predict the performance of students in the faculty of Computer Science and Information Technology. Tey proposed supervised machine learning algorithms to predict the results of the examination so, they work under two steps. Te preprocessing step is to prepare the data, clean it, and then use the machine learning algorithms to predict performance. Tey used several supervised algorithms, and the results proved that the logistic regression classifer gets the best results for 498 students.
In [22], the authors proposed a decision tree algorithm compared with the other three algorithms. Tey used Weka tools and test the data collected to predict failure and success. Tey tested the features that afect the accuracy based on the model results on gaining relevant features. Te results of features selected fve relevant features from ten total features. After using several popular machine learning algorithms (J48, Random Tree, and RepTree), they recommended that the decision tree algorithm is the best solution for high accuracy.
In [19], the authors proposed their study using feature selection in supervised machine learning algorithms for higher education. Tey used Weka mining tool in their experiments which is the most popular tool for mining. Te dataset consists of 11 features that selected out 45 features to predict the student's residence country which was trained and tested with diferent methods. Tey used K-Fold, Hold-Out, and Leave One Out, and then the results found that Leave One Out obtained high accuracy with Random Forest, and GRAE algorithm results enhanced the accuracy and obtained the highest accuracy. In [23], the authors proposed a Generalized Feature Selection (GeFeS) method-based machine learning genetic algorithm to choose a subset of features that were unique and important. In this study, the method used an efcient and fast prediction method to optimize the performance for high accuracy and minimize the cost. Genetic Algorithm (GA) with the sequence of operators had been used to be more relevant and intelligent. Operators in GA are used to increase the capability which allows dealing with a variant dataset (small and large scale). Tis method succeeded in increasing the accuracy and evaluating F-measure, and then, the results were compared with other feature selection methods. Te proposed algorithm could illustrate high performance compared with previous methods that were used before considering the same datasets. In [24], two feature selection methods are combined (CHI and MI) to measure the performance, which could evaluate the scores of features. Te new features' scores had been normalized then, measuring the performance of the student in the education process as it considered important agent from the multiagent that were found in the educational sector. Tis study presented comparison results of using diferent predictive models and illustrated the accuracy for each model to develop the performance.

Methodology
Te proposed multiagent framework-based e-learning educational system is shown in Figure 1. It consists of the following steps: data collection, preprocessing dataset, integrating dataset, feature extraction methods, splitting dataset, training and optimizing ML algorithms, and evaluating ML algorithms. We will describe each step as follows. 2 Computational Intelligence and Neuroscience

Data Collection. We used Open University Learning
Analytics dataset [25] to make our experiment. Tis dataset contains seven multiagent described as CSV fles, and each fle contains a table with several features: (1) Courses: the courses that students should be studied per semester (2) Assessment of students: the results of all assessments should be submitted after being completed by students (3) Information about students: the student's basic information (4) Registration of students: the date that students are allowed to register for the course (5) Virtual learning environment of students: the interaction that belongs to students on each course could be recorded (6) Virtual learning environment: each material of courses could be found in diferent types and styles of learning; then, each student could access them and the activity of students could be recorded.
(7) Assessment: the evaluation of students during the semester which contains the results of all assignments that had been submitted Te dataset includes many learning and activity types that could be applied for students in each course. Te collected dataset evaluates the interaction that belongs to 32,593 students that interacted with 19 activity types and their styles in 22 courses. In our work, we will study the impact of using four agents that will be integrated named as follows: courses, students' information, virtual learning environment of students, and their VLE. Tese integrated agents will be illustrated to improve the learning system in the educational process. Te following sections will describe the preprocessing steps on the dataset and propose a developed multiagent e-learning system that contains integrated four agents. Table 1 describes each agent and its attributes.

Preprocessing Dataset.
Tis paper tried to solve the problem by converting it into a binary classifcation problem. Te student's info table includes a class label and the value of the class label contains four values: pass, fail, withdrawn, and distinction. Te distinction is converted into a pass value and the withdrawn is converted into a fail value. We integrated the student's vle table with the VLE table into  one table which is called student's learning style and activity; the names of learning and activities in the VLE table are extracted and added as attributes in the student's vle table and flling values of attributes by the number of total clicks for each student in a course.

Integrating Dataset.
Te integrated tables have been combined by using left join. Te student's learning style and activity table is integrated with the student's info table which contains the following attributes "id student," "code module," and "code presentation" by implementing left join.

Feature Selection Methods.
Te key advantages of employing feature selection techniques are used to identify and select the most essential and most ranked features from the dataset. Machine learning algorithm-based feature selection methods are used to achieve the best performance. Te two methods are used, namely, univariate and Extra Trees feature selection methods: (i) Univariate feature selection is used to select the best features from all features depending on univariate statistical tests. In this method, each feature will have its own rank and score, and then, it is easy to select the high scored features considered as the best features. (ii) Extra Trees extended its function from the original set of the data sample. In the test set, each one of the test nodes with each one of the trees is supported with a number of random features depending on each one of the decision trees. Each decision tree  Computational Intelligence and Neuroscience should select the relevant feature-based mathematical algorithm [26,27].

Splitting Dataset.
Te integrated dataset is partitioned into a training set of data and a testing set of data. Te training set is used to optimize ML algorithms by implementing grid search and stratifed cross-validation. Testing set is used to evaluate ML algorithms performance by four measurement methods: accuracy (A), precision (P), recall (R), and F-measure (F). Te results of cross-validation and testing are registered for each ML algorithm.
3.6. Training and Optimizing ML Algorithms. Grid search with cross-validation is used to optimize ML algorithms and enhance the performance of algorithms. Grid search is a technique used for determining the best hyperparameters for ML algorithms in order to achieve the best results. CV splits the dataset into k subsets so that ML algorithms can be trained on k-1 subsets (the training set) andthe testing subsetis used to test machine learning algorithms. ML algorithms are used to develop a multiagent e-learning system. Tese algorithms are (1) Naive Bayes (NB) classifer is considered one of the classifcation supervised machine learning approaches assuming that there are two independent features. NB estimates relevant parameters, so it is considered one of the high classifcation techniques for relevant output [28]. (2) Random Forest (RF) is a machine learning model used for classifcation problems that are used because of its fexibility. It could use to operate many decision trees at the frst step of preprocessing data (training set step) and then calculate the average of prediction of the trees. Random Forest was used to estimate the accuracy in exploratory data analysis (EDA) step which could deal with large dataset. It is used as an efective way to deal with enormous features and retrieve estimated feature-based algorithm [29,30]. (3) Decision Tree (DT) classifcation supervised algorithm is the most popular algorithm for the machine learning algorithm. It has branches with nodes for constructing graphs to present internal node as test feature communicated in every leaf as result as gaining parent node, and then leaf could be assigned the label of the class. DT is classifed as a top-down approach that starts from the root point of the tree. Te branch is submitted as signifcance for its node to decide the label [28,31]. Decision Tree Algorithm contains a root which splits into branches to make the prediction (decision) [28]. Tis algorithm is one of the most common algorithms that could address the problem in a process that identifes the solution accurately and fast.

Experiment Setup.
Tis paper's experiments were run on Python 3. ML models were implemented using the sci-kit-learn package. ML algorithms are optimized using grid search with cross-validation. Te dataset was partitioned into two parts: an 80% training set for optimizing models and registering crossvalidation results and a 20% testing dataset (unseen data) for evaluating models and registering testing results. We conducted various experiments to study the efect of learning and activity types in the educational process using feature selection methods based on fve ML algorithms: DT, KNN, NB, LR, and RF. First, feature selection methods have been applied to the database for determining the important features. Second, ML algorithms are used based on full features. Tird, ML algorithms have been implemented on the top thirteen features that recorded the highest scores. Fourth, ML algorithms have been implemented as another experiment on the top six features that have the highest scores or rankings. Te results of the crossvalidation and testing have been recorded using accuracy (A), precision (P), recall (R), and F-measure (F).

Results of Applying Feature Selection Methods.
In this section, we will describe the results of applying feature selection methods: univariate and Extra Trees on the database.

Univariate Feature Selection Method.
Univariate assigns scores for each feature, and we selected the important and best features based on high scores. Table 2 shows the scores of all features of applying the univariate method on the dataset. We can see that the oucontent activity is registered that contains the frst high score with 5494843.899. Forumng activity has registered the second high score with 3793119.894. Html activity has registered the lowest score at 1012.523433 for activities. Code presentation has registered the worst score at 0.377850061 for all features.

Extra Trees Feature Selection Method.
Extra Trees assigns ranking for each feature, and we selected the best features based on high ranking. Figure 2 shows the ranking of all features of applying Extra Trees on the dataset. We can see that the homepage and quiz have the highest ranking at 12.5 and 12.24, respectively. Te repeat activity has registered the lowest rank at 0.01. Resource, url, and code module have approximately the same rank at 6.78, 6.61, and 6.15, respectively.

Results of Applying ML Algorithms to Full Features.
ML algorithms have been applied to full features, and the results of cross-validation and testing performance of applying ML algorithms have been recorded as shown in

Results of Applying ML Algorithms to Tirteen Features.
Two feature selection methods will be applied, thirteen features are selected because of their high ranking and  Trees. Te top 13 features, homepage, quiz, oucontent, subpage, forumng, resource, url, code module, ouwiki, oucollaborate, page, questionnaire, and glossary with high ranking, have been selected. ML algorithms have been applied to 13 features, and the results of cross-validation and testing performance have been recorded as shown in Table 5.

Results of Applying ML Algorithms to Six Selected Features.
After applying two feature selection methods, six features with high ranking or scores have been selected. ML algorithms have been applied and the results of cross-validation and testing have been recorded.

Six Selected Features by Univariate.
Te top six features, oucontent, forumng, quiz, homepage, subpage, and ouwiki with high scores, have been selected. ML algorithms have been applied to six features, and the results of crossvalidation and testing performance of applying ML algorithms have been recorded as shown in Table 6.

Six Selected Features by Extra Trees.
Te top six features, homepage, quiz, oucontent, subpage, forumng, and resource with high ranking, have been selected, and ML algorithms have been applied to six features, and the results of cross-validation and testing performance have been recorded as shown in Table 7.

Conclusion
Tis paper proposed a developed multiagent e-learning system to examine the interactions between agents that impact on e-learning process in the educational environment. Te proposed framework briefy consists of the following steps: data collection, data preprocessing, integrating multiagents, feature extraction methods, and training and optimizing ML algorithms in addition to evaluating the performance of ML algorithms. In the integrating step, agents had been combined and used as tables named: course, student's info, student's vle, and VLE in one table using left join. In the feature selection steps, univariate and Extra Trees Classifer feature selection methods are used to select the most attributes that are relevant and play an important action in enhancing our multiagent framework. Diferent machine learning algorithms are used: DT, RF, LR, NB, and KNN, which are applied to select the high-ranked and relevant features. ML algorithms' performance was evaluated using diferent measurement methods: ACC, PER, REC, and FM. Te results showed that RF with 13 selected features by Extra Trees has achieved the highest performance for cross-validation (ACC � 87.6%, PRE � 88.05%, REC � 87.71%, and FM � 87.7%) and testing (ACC � 86.72%, PRE � 87.08%, REC � 86.72%, and FM � 86.73%).

Conflicts of Interest
All authors declare that they have no conficts of interest.  Figure 3: Te best algorithms for 13 features.  Figure 4: Te best algorithms for 6 features.
Computational Intelligence and Neuroscience 7