Latest Results The latest content available from Springer
- International Journal of Computational Intelligence Systemsel julio 24, 2024 a las 12:00 am
- A Novel Hierarchical High-Dimensional Unsupervised Active Learning Methodel julio 24, 2024 a las 12:00 am
Abstract This paper processes a novel hierarchical high-dimensional clustering algorithm based on the Active Learning Method (ALM), which is a fuzzy-learning algorithm. The hierarchical part of the algorithm is composed of two phases: divisible and agglomerative. The divisible phase, a zooming-in-process, searches for sub-clusters in already-found clusters hierarchically. At each level of the hierarchy, the clusters are found by an ensemble clustering method based on the density of data. This part of the algorithm blurs each data point as multiple one-dimensional fuzzy membership functions called ink-drop patterns; then, it accumulates the ink-drop patterns of all data points on every dimension separately. Next, it performs one-dimensional density partitioning to produce an ensemble of clustering solutions; after that, combining the results is done based on a novel consensus method with the aid of prime numbers. An agglomerative phase is a bottom-up approach that merges clusters based on a novel distance metric, named \({K}^{2}\) -nearest neighbor. The algorithm is named as the Hierarchical High-Dimensional Unsupervised Active Learning Method (HiDUALM) and is explained in more detail throughout this paper. Although the classical clustering methods are not suitable for high-dimensional data clustering, the proposed method solves the problems related to speed and memory using ensemble learning, while due to its hierarchy and the use of different distance criteria, different levels of the cluster provide the clause. Experiments on synthetic and real-world datasets are presented to show the effectiveness of the proposed-clustering algorithm.
- Online Learning Based on Learning Analytics in Big Data for College English Language Teachingel julio 22, 2024 a las 12:00 am
Abstract This study addresses the challenges of enhancing the quality of education and improving the overall student experience in online English language teaching sessions. Current approaches often struggle with session initiation, real-time data processing, and personalized learning experiences. To tackle these issues, the study proposes a manifold learning data analytics model (MLDAM). This innovative method leverages classifier tree learning to distinguish between trivial and non-trivial aspects of student learning experiences and session data. MLDAM integrates multi-dimensional data extraction, classification learning, and impact evaluation to enhance the effectiveness of online English language teaching. The model analyzes data from 176,009 English phrases across 36 online teaching sessions, focusing on improving session accessibility, student comprehension, and suggestion accuracy. Using an iterative training process based on student performance and feedback, it continuously extracts and processes multiple types of data to refine teaching strategies. Results show notable improvements: a 14.74% increase in classification accuracy, an 8.73% increase in data extraction ratio, an 11.84% reduction in feature discard, a 10.57% decrease in initialization time, and a 13.17% reduction in classification time. These metrics demonstrate MLDAM’s ability to efficiently process and analyze session data, enabling real-time adjustments during lessons. The model optimizes data utilization, making learning more responsive and adaptable. It enhances the precision of impact evaluations, facilitating targeted course adjustments and personalized learning experiences. This research presents a comprehensive, data-driven methodology for improving teaching quality and student outcomes in virtual English classrooms.
- Porosity Prediction Based on Ensemble Learning for Feature Selection and an Optimized GRU Improved by the PSO Algorithmel julio 22, 2024 a las 12:00 am
Abstract Accurate and reliable prediction of porosity forms the foundational basis for evaluating reservoir quality, which is essential for the systematic deployment of oil and gas exploration and development plans. When data quality of samples is low, and critical model parameters are typically determined through subjective experience, resulting in diminished accuracy and reliability of porosity prediction methods utilizing gated recurrent units (GRU), a committee-voting ensemble learning (EL) method, and an enhanced particle swarm optimization (PSO) algorithm are proposed to optimize the GRU-based porosity prediction model. Initially, outliers are eliminated through box plots and the min–max normalization is applied to enhance data quality. To address issues related to model accuracy and high training costs arising from dimensional complexity, substantial noise, and redundant information in logging data, a committee-voting EL strategy based on four feature selection algorithms is introduced. Following data preprocessing, this approach is employed to identify logging parameters highly correlated with porosity, thereby furnishing the most pertinent data samples for the GRU model, mitigating constraints imposed by single-feature selection methods. Second, an improved PSO algorithm is suggested to tackle challenges associated with low convergence accuracy stemming from random population initialization, alongside the absence of global optimal solutions due to overly rapid particle movement during iteration. This algorithm uses a good-point set for population initialization and incorporates a compression factor to devise an adaptive velocity updating strategy, thereby enhancing search efficacy. The enhanced PSO algorithm’s superiority is substantiated through comparison with four alternative swarm intelligent algorithms across 10 benchmark test functions. Ultimately, optimal hyper-parameters for the GRU model are determined using the improved PSO algorithm, thereby minimizing the influence of human factors. Experimental findings based on approximately 15,000 logging data points from well A01 in an operational field validate that, relative to three other deep learning methodologies, the proposed model proficiently extracts spatiotemporal features from logging data, yielding enhanced accuracy in porosity prediction. The mean squared error on the test set was 7.19 × 10–6, the mean absolute error stood at 0.0082, and coefficient of determination reached 0.99, offering novel insights for predicting reservoir porosity.
- Multi-class Breast Cancer Classification Using CNN Features Hybridizationel julio 22, 2024 a las 12:00 am
Abstract Breast cancer has become the leading cause of cancer mortality among women worldwide. The timely diagnosis of such cancer is always in demand among researchers. This research pours light on improving the design of computer-aided detection (CAD) for earlier breast cancer classification. Meanwhile, the design of CAD tools using deep learning is becoming popular and robust in biomedical classification systems. However, deep learning gives inadequate performance when used for multilabel classification problems, especially if the dataset has an uneven distribution of output targets. And this problem is prevalent in publicly available breast cancer datasets. To overcome this, the paper integrates the learning and discrimination ability of multiple convolution neural networks such as VGG16, VGG19, ResNet50, and DenseNet121 architectures for breast cancer classification. Accordingly, the approach of fusion of hybrid deep features (FHDF) is proposed to capture more potential information and attain improved classification performance. This way, the research utilizes digital mammogram images for earlier breast tumor detection. The proposed approach is evaluated on three public breast cancer datasets: mammographic image analysis society (MIAS), curated breast imaging subset of digital database for screening mammography (CBIS-DDSM), and INbreast databases. The attained results are then compared with base convolutional neural networks (CNN) architectures and the late fusion approach. For MIAS, CBIS-DDSM, and INbreast datasets, the proposed FHDF approach provides maximum performance of 98.706%, 97.734%, and 98.834% of accuracy in classifying three classes of breast cancer severities.