Hypergraph contrastive learning for recommendation with side informationAo, Dun; Cao, Qian; Wang, Xiaofeng
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-06-2024-0266
This paper addresses the limitations of current graph neural network-based recommendation systems, which often neglect the integration of side information and the modeling of complex high-order interactions among nodes. The research motivation stems from the need to enhance recommendation performance by effectively utilizing all available data. We propose a novel method called MSHCN, which leverages hypergraph neural networks to integrate side information and model complex interactions, thereby improving user and item representations.Design/methodology/approachThe MSHCN method employs a hypergraph structure to incorporate various types of side information, including social relationships among users and item attributes, which are essential for enriching user and item representations. The k-means clustering algorithm is utilized to create item-associated hypergraphs, while sentiment analysis on user reviews refines the modeling of user interests. Additionally, hypergraphs are constructed for user-user and item-item interactions based on interaction similarity. MSHCN also incorporates contrastive learning as an auxiliary task to enhance the representation learning process.FindingsExtensive experiments demonstrate that MSHCN significantly outperforms existing recommendation models, particularly in its ability to capture and utilize side information and high-order interactions. This results in superior user and item representations and improved recommendation performance.Originality/valueThe novelty of MSHCN lies in its use of a hypergraph structure to integrate diverse side information and model intricate high-order interactions. The incorporation of contrastive learning as an auxiliary task sets it apart from other hypergraph-based models, providing a significant enhancement in recommendation accuracy.
A novel ensemble causal feature selection approach with mutual information and group fusion strategy for multi-label dataZheng, Yifeng; Zeng, Xianlong; Zhang, Wenjie; Wei, Baoya; Ren, Weishuo; Qing, Depeng
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-04-2024-0144
As intelligent technology advances, practical applications often involve data with multiple labels. Therefore, multi-label feature selection methods have attracted much attention to extract valuable information. However, current methods tend to lack interpretability when evaluating the relationship between different types of variables without considering the potential causal relationship.Design/methodology/approachTo address the above problems, we propose an ensemble causal feature selection method based on mutual information and group fusion strategy (CMIFS) for multi-label data. First, the causal relationship between labels and features is analyzed by local causal structure learning, respectively, to obtain a causal feature set. Second, we eliminate false positive features from the obtained feature set using mutual information to improve the feature subset reliability. Eventually, we employ a group fusion strategy to fuse the obtained feature subsets from multiple data sub-space to enhance the stability of the results.FindingsExperimental comparisons are performed on six datasets to validate that our proposal can enhance the interpretation and robustness of the model compared with other methods in different metrics. Furthermore, the statistical analyses further validate the effectiveness of our approach.Originality/valueThe present study makes a noteworthy contribution to proposing a causal feature selection approach based on mutual information to obtain an approximate optimal feature subset for multi-label data. Additionally, our proposal adopts the group fusion strategy to guarantee the robustness of the obtained feature subset.
Optimization of semi-supervised generative adversarial network models: a surveyMa, Yongqing; Zheng, Yifeng; Zhang, Wenjie; Wei, Baoya; Lin, Ziqiong; Liu, Weiqiang; Li, Zhehan
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-05-2024-0202
With the development of intelligent technology, deep learning has made significant progress and has been widely used in various fields. Deep learning is data-driven, and its training process requires a large amount of data to improve model performance. However, labeled data is expensive and not readily available.Design/methodology/approachTo address the above problem, researchers have integrated semi-supervised and deep learning, using a limited number of labeled data and many unlabeled data to train models. In this paper, Generative Adversarial Networks (GANs) are analyzed as an entry point. Firstly, we discuss the current research on GANs in image super-resolution applications, including supervised, unsupervised, and semi-supervised learning approaches. Secondly, based on semi-supervised learning, different optimization methods are introduced as an example of image classification. Eventually, experimental comparisons and analyses of existing semi-supervised optimization methods based on GANs will be performed.FindingsFollowing the analysis of the selected studies, we summarize the problems that existed during the research process and propose future research directions.Originality/valueThis paper reviews and analyzes research on generative adversarial networks for image super-resolution and classification from various learning approaches. The comparative analysis of experimental results on current semi-supervised GAN optimizations is performed to provide a reference for further research.
X-News dataset for online news categorizationYousafzai, Samia Nawaz; Shahbaz, Hooria; Ali, Armughan; Qamar, Amreen; Nasir, Inzamam Mashood; Tehsin, Sara; Damaševičius, Robertas
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-04-2024-0184
The objective is to develop a more effective model that simplifies and accelerates the news classification process using advanced text mining and deep learning (DL) techniques. A distributed framework utilizing Bidirectional Encoder Representations from Transformers (BERT) was developed to classify news headlines. This approach leverages various text mining and DL techniques on a distributed infrastructure, aiming to offer an alternative to traditional news classification methods.Design/methodology/approachThis study focuses on the classification of distinct types of news by analyzing tweets from various news channels. It addresses the limitations of using benchmark datasets for news classification, which often result in models that are impractical for real-world applications.FindingsThe framework’s effectiveness was evaluated on a newly proposed dataset and two additional benchmark datasets from the Kaggle repository, assessing the performance of each text mining and classification method across these datasets. The results of this study demonstrate that the proposed strategy significantly outperforms other approaches in terms of accuracy and execution time. This indicates that the distributed framework, coupled with the use of BERT for text analysis, provides a robust solution for analyzing large volumes of data efficiently. The findings also highlight the value of the newly released corpus for further research in news classification and emotion classification, suggesting its potential to facilitate advancements in these areas.Originality/valueThis research introduces an innovative distributed framework for news classification that addresses the shortcomings of models trained on benchmark datasets. By utilizing cutting-edge techniques and a novel dataset, the study offers significant improvements in accuracy and processing speed. The release of the corpus represents a valuable contribution to the field, enabling further exploration into news and emotion classification. This work sets a new standard for the analysis of news data, offering practical implications for the development of more effective and efficient news classification systems.
Agent-SwinPyramidNet: an enhanced deep learning model with AMTCF-VMD for anomaly detection in oil and gas pipelinesZhang, Yizhuo; Zhang, Yunfei; Yu, Huiling; Shi, Shen
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-07-2024-0310
The anomaly detection task for oil and gas pipelines based on acoustic signals faces issues such as background noise coverage, lack of effective features, and small sample sizes, resulting in low fault identification accuracy and slow efficiency. The purpose of this paper is to study an accurate and efficient method of pipeline anomaly detection.Design/methodology/approachFirst, to address the impact of background noise on the accuracy of anomaly signals, the adaptive multi-threshold center frequency variational mode decomposition method(AMTCF-VMD) method is used to eliminate strong noise in pipeline signals. Secondly, to address the strong data dependency and loss of local features in the Swin Transformer network, a Hybrid Pyramid ConvNet network with an Agent Attention mechanism is proposed. This compensates for the limitations of CNN’s receptive field and enhances the Swin Transformer’s global contextual feature representation capabilities. Thirdly, to address the sparsity and imbalance of anomaly samples, the SpecAugment and Scaper methods are integrated to enhance the model’s generalization ability.FindingsIn the pipeline anomaly audio and environmental datasets such as ESC-50, the AMTCF-VMD method shows more significant denoising effects compared to wavelet packet decomposition and EMD methods. Additionally, the model achieved 98.7% accuracy on the preprocessed anomaly audio dataset and 99.0% on the ESC-50 dataset.Originality/valueThis paper innovatively proposes and combines the AMTCF-VMD preprocessing method with the Agent-SwinPyramidNet model, addressing noise interference and low accuracy issues in pipeline anomaly detection, and providing strong support for oil and gas pipeline anomaly recognition tasks in high-noise environments.
ConjunctiveNet: an improved deep learning-based conjunctive-eyes segmentation and severity detection modelPahwa, Seema; Kaur, Amandeep; Dhiman, Poonam; Damaševičius, Robertas
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-04-2024-0189
The study aims to enhance the detection and classification of conjunctival eye diseases' severity through the development of ConjunctiveNet, an innovative deep learning framework. This model incorporates advanced preprocessing techniques and utilizes a modified Otsu’s method for improved image segmentation, aiming to improve diagnostic accuracy and efficiency in healthcare settings.Design/methodology/approachConjunctiveNet employs a convolutional neural network (CNN) enhanced through transfer learning. The methodology integrates rescaling, normalization, Gaussian blur filtering and contrast-limited adaptive histogram equalization (CLAHE) for preprocessing. The segmentation employs a novel modified Otsu’s method. The framework’s effectiveness is compared against five pretrained CNN architectures including AlexNet, ResNet-50, ResNet-152, VGG-19 and DenseNet-201.FindingsThe study finds that ConjunctiveNet significantly outperforms existing models in accuracy for detecting various severity stages of conjunctival eye conditions. The model demonstrated superior performance in classifying four distinct severity stages – initial, moderate, high, severe and a healthy stage – offering a reliable tool for enhancing screening and diagnosis processes in ophthalmology.Originality/valueConjunctiveNet represents a significant advancement in the automated diagnosis of eye diseases, particularly conjunctivitis. Its originality lies in the integration of modified Otsu’s method for segmentation and its comprehensive preprocessing approach, which collectively enhance its diagnostic capabilities. This framework offers substantial value to the field by improving the accuracy and efficiency of conjunctival disease severity classification, thus aiding in better healthcare delivery.
MFLD: lightweight object detection with multi-receptive field and long-range dependency in remote sensing imagesWang, Weixing; Chen, Yixia; Lin, Mingwei
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-01-2024-0020
Based on the strong feature representation ability of the convolutional neural network (CNN), generous object detection methods in remote sensing (RS) have been proposed one after another. However, due to the large variation in scale and the omission of relevant relationships between objects, there are still great challenges for object detection in RS. Most object detection methods fail to take the difficulties of detecting small and medium-sized objects and global context into account. Moreover, inference time and lightness are also major pain points in the field of RS.Design/methodology/approachTo alleviate the aforementioned problems, this study proposes a novel method for object detection in RS, which is called lightweight object detection with a multi-receptive field and long-range dependency in RS images (MFLD). The multi-receptive field extraction (MRFE) and long-range dependency information extraction (LDIE) modules are put forward.FindingsTo concentrate on the variability of objects in RS, MRFE effectively expands the receptive field by a combination of atrous separable convolutions with different dilated rates. Considering the shortcomings of CNN in extracting global information, LDIE is designed to capture the relationships between objects. Extensive experiments over public datasets in RS images demonstrate that our MFLD method surpasses the state-of-the-art methods. Most of all, on the NWPU VHR-10 dataset, our MFLD method achieves 94.6% mean average precision with 4.08 M model volume.Originality/valueThis paper proposed a method called lightweight object detection with multi-receptive field and long-range dependency in RS images.
A ranking framework for the selection of IoT cloud platforms using hybrid multi-attribute decision-making methodRaheja, Supriya; Garg, Rakesh; Garg, Ritvik
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-05-2024-0211
The Internet of Things (IoT) cloud platforms provide end-to-end solutions that integrate various capabilities such as application development, device and connectivity management, data storage, data analysis and data visualization. The high use of these platforms results in their huge availability provided by different capabilities. Therefore, choosing the optimal IoT cloud platform to develop IoT applications successfully has become crucial. The key purpose of the present study is to implement a hybrid multi-attribute decision-making approach (MADM) to evaluate and select IoT cloud platforms.Design/methodology/approachThe optimal selection of the IoT cloud platforms seems to be dependent on multiple attributes. Hence, the optimal selection of IoT cloud platforms problem is modeled as a MADM problem, and a hybrid approach named neutrosophic fuzzy set-Euclidean taxicab distance-based approach (NFS-ETDBA) is implemented to solve the same. NFS-ETDBA works on the calculation of assessment score for each alternative, i.e. IoT cloud platforms, by combining two different measures: Euclidean and taxicab distance.FindingsA case study to illustrate the working of the proposed NFS-ETDBA for optimal selection of IoT cloud platforms is given. The results obtained on the basis of calculated assessment scores depict that “Azure IoT suite” is the most preferable IoT cloud platform, whereas “Salesman IoT cloud” is the least preferable.Originality/valueThe proposed NFS-ETDBA methodology for the IoT cloud platform selection is implemented for the first time in this field. ETDBA is highly capable of handling the large number of alternatives and the selection attributes involved in any decision-making process. Further, the use of fuzzy set theory (FST) makes it very easy to handle the impreciseness that may occur during the data collection through a questionnaire from a group of experts.
Consensus model for probabilistic linguistic multi-attribute group decision-making based on incomplete social trust networksKang, Kaiying; Xie, Jialiang; Liu, Xiaohui; Qiu, Jianxiang
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-07-2024-0332
Experts may adjust their assessments through communication and mutual influence, and this dynamic evolution relies on the spread of internal trust relationships. Due to differences in educational backgrounds and knowledge experiences, trust relationships among experts are often incomplete. To address such issues and reduce decision biases, this paper proposes a probabilistic linguistic multi-attribute group decision consensus model based on an incomplete social trust network (InSTN).Design/methodology/approachIn this paper, we first define the new trust propagation operators based on the operations of Probability Language Term Set (PLTS) with algebraic t-conorm and t-norm, which are combined with trust aggregation operators to estimate InSTN. The adjustment coefficients are then determined through trust relations to quantify their impact on expert evaluation. Finally, the particle swarm algorithm (PSO) is used to optimize the expert evaluation to meet the consensus threshold.FindingsThis study demonstrates the feasibility of the method through the selection of treatment plans for complex cases. The proposed consensus model exhibits greater robustness and effectiveness compared to traditional methods, mainly due to the effective regulation of trust relations in the decision-making process, which reduces decision bias and inconsistencies.Originality/valueThis paper introduces a novel probabilistic linguistic multi-attribute swarm decision consensus model based on an InSTN. It proposes a redefined trust propagation and aggregation approach to estimate the InSTN. Moreover, the computational efficiency and decision consensus accuracy of the proposed model are enhanced by using PSO optimization.
A method for recognizing abnormal behaviors of personnel at petroleum stations based on GTB-ResNetYu, Huiling; Dai, Sijia; Shi, Shen; Zhang, Yizhuo
2024 International Journal of Intelligent Computing and Cybernetics
doi: 10.1108/ijicc-07-2024-0317
The abnormal behaviors of staff at petroleum stations pose significant safety hazards. Addressing the challenges of high parameter counts, lengthy training periods and low recognition rates in existing 3D ResNet behavior recognition models, this paper proposes GTB-ResNet, a network designed to detect abnormal behaviors in petroleum station staff.Design/methodology/approachFirstly, to mitigate the issues of excessive parameters and computational complexity in 3D ResNet, a lightweight residual convolution module called the Ghost residual module (GhostNet) is introduced in the feature extraction network. Ghost convolution replaces standard convolution, reducing model parameters while preserving multi-scale feature extraction capabilities. Secondly, to enhance the model's focus on salient features amidst wide surveillance ranges and small target objects, the triplet attention mechanism module is integrated to facilitate spatial and channel information interaction. Lastly, to address the challenge of short time-series features leading to misjudgments in similar actions, a bidirectional gated recurrent network is added to the feature extraction backbone network. This ensures the extraction of key long time-series features, thereby improving feature extraction accuracy.FindingsThe experimental setup encompasses four behavior types: illegal phone answering, smoking, falling (abnormal) and touching the face (normal), comprising a total of 892 videos. Experimental results showcase GTB-ResNet achieving a recognition accuracy of 96.7% with a model parameter count of 4.46 M and a computational complexity of 3.898 G. This represents a 4.4% improvement over 3D ResNet, with reductions of 90.4% in parameters and 61.5% in computational complexity.Originality/valueSpecifically designed for edge devices in oil stations, the 3D ResNet network is tailored for real-time action prediction. To address the challenges posed by the large number of parameters in 3D ResNet networks and the difficulties in deployment on edge devices, a lightweight residual module based on ghost convolution is developed. Additionally, to tackle the issue of low detection accuracy of behaviors amidst the noisy environment of petroleum stations, a triple attention mechanism is introduced during feature extraction to enhance focus on salient features. Moreover, to overcome the potential for misjudgments arising from the similarity of actions, a Bi-GRU model is introduced to enhance the extraction of key long-term features.