Publications
Full list of publications by Chen Hajaj — machine learning, cybersecurity, healthcare AI, and multi-agent systems.
2026
- Machine Learning Tools for Predicting Pediatric Urinary Tract Infections Caused by ESBL-Producing Bacteria2026Pediatric Infectious Disease Journal
- Learning through Concept Generation: The Case of Academic and Practitioner Teaching in the Design Engineering Studio2026Proceedings of the International Conference on Design, Computing and Cognition (DCC 2026)
- Uncovering Microservice Faults: A Temporal Graph Approach to Root Cause Analysis2026Proceedings of the IEEE International Conference on Communications. ICC 2026
- Real-Time Network Security: Integrating ANN and Dynamic Graph-Based Clustering2026Computer Networks
- Quality of Experience Prediction for First Person Shooter Online Gaming: The Case Study of Call of Duty2026Proceedings of the IEEE Consumer Communications & Networking Conference (CCNC 2026)
- Cloudy with a Chance of Anomalies: Dynamic Graph Neural Network for Early Detection of Cloud Services’ User Anomalies2026Proceedings of the IEEE Consumer Communications & Networking Conference (CCNC 2026)
- The Sound of Emotions: An Artificial Intelligence Approach to Predicting Emotions from Musical Selections2026Multimedia Systems
- ★ Cleaner Adversarial CAPTCHAs: Intelligent Targets and Precise Noise for Usable Security2026Proceedings of the 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)Traditional CAPTCHAs are increasingly vulnerable to deep learning-based solvers that decode text and images with high accuracy. In this work, we propose methods to strengthen adversarial CAPTCHAs without compromising human usability. First, we introduce a Precise Gradient Method (PGM) that preserves gradient magnitude (rather than discarding it via a sign operator), producing adversarial perturbations with significantly lower perceptual noise. Second, we develop intelligent target class selection, using either dataset-level confusion structure (Class Relations Network) or image-specific softmax probabilities (Distance-Based Target), to steer adversarial perturbations more efficiently. Across multiple modern architectures (MobileNets, EfficientNets, ResNet, and Vision Transformer), our framework achieves faster convergence (fewer iterations), reduced visual distortion, and notably greater robustness under iterative adversarial retraining. Experiments show that our methods consistently reduce iteration counts and perceptual distortion while significantly increasing the difficulty for automated attacks. Our results offer a practical, scalable path toward the next generation of CAPTCHA systems and contribute new insights to the adversarial machine learning landscape focused on security and usability.Abstract
2025
- Survival Forest Models for ICU Mortality Prediction based on Nutrition and Clinical Factors2025IEEE Access
- Cloudy with a Chance of Anomalies: Dynamic Graph Neural Network for Early Detection of Cloud Services’ User Anomalies2025Proceedings of the 34th International Conference on Computer Communications and Networks
- Optimized File Type Detection and One-Shot Reclassification Model2025Proceedings of the IEEE International Conference on Communications
- A New D-MAGIC: Dynamic Model for Cybersecurity Attack Detection Using GNNs into Clustering2025Proceedings of the IEEE International Conference on Communications
- Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques2025Proceedings of the IEEE International Conference on Communications
- PQClass: Classification of Post-Quantum Encryption Applications in Internet Traffic2025Proceedings of the IEEE International Conference on Communications
- Sentiment Analysis of Student-Tutor Interactions in VR Design Crits2025The Association for Computer-Aided Architectural Design Research in Asia (CAADRIA)
- ★ A Classification-by-Retrieval Framework for Few-Shot Anomaly Detection to Detect API Injection2025Computers & SecurityApplication Programming Interface (API) Injection attacks refer to the unauthorized or malicious use of APIs, which are often exploited to gain access to sensitive data or manipulate online systems for illicit purposes. Identifying actors that deceitfully utilize an API poses a demanding problem. Although there have been notable advancements and contributions in the field of API security, there remains a significant challenge when dealing with attackers who use novel approaches that don’t match the well-known payloads commonly seen in attacks. Also, attackers may exploit standard functionalities unconventionally and with objectives surpassing their intended boundaries. Thus, API security needs to be more sophisticated and dynamic than ever, with advanced computational intelligence methods, such as machine learning models that can quickly identify and respond to abnormal behavior. In response to these challenges, we propose a novel unsupervised few-shot anomaly detection framework composed of two main parts: First, we train a dedicated generic language model for API based on FastText embedding. Next, we use Approximate Nearest Neighbor search in a classification-by-retrieval approach. Our framework allows for training a fast, lightweight classification model using only a few examples of normal API requests. We evaluated the performance of our framework using the CSIC 2010 and ATRDF 2023 datasets. The results demonstrate that our framework improves API attack detection accuracy compared to the state-of-the-art (SOTA) unsupervised anomaly detection baselines.Abstract DOI
- Predicting Onset and Progression of Neurodegenerative Diseases Using Blood Test Data and Machine Learning Models2025
- Leveraging OSINT for Advanced Proactive Cybersecurity: Strategies and Solutions2025IEEE Access
- Measuring and Analyzing Defects of Additive Manufactured Ti-6Al-4V Specimens Through Image Segmentation2025Fatigue & Fracture of Engineering Materials & Structures
2024
- Phenotypical Characteristics of Nontuberculous Mycobacterial Infection in Patients with Bronchiectasis2024Respiratory Research
- Warm Recommendation: Enhancing Cold Start Recommendations Using Multimodal Product Representations2024International Conference on Information Systems (ICIS)
- ★ The Art of Time-Bending: Data Augmentation and Early Prediction for Efficient Traffic Classification2024Expert Systems with ApplicationsComputational efficiency is an important consideration for deploying machine learning models for time series prediction in an online setting. Machine learning algorithms adjust model parameters automatically based on the data, but often require users to set additional parameters, known as hyperparameters. Hyperparameters can significantly impact prediction accuracy. Traffic measurements, typically collected online by sensors, are serially correlated. Moreover, the data distribution may change gradually. A typical adaptation strategy is periodically re-tuning the model hyperparameters, at the cost of computational burden. In this work, we present an efficient and principled online hyperparameter optimization algorithm for Kernel Ridge regression applied to traffic prediction problems. In tests with real traffic measurement data, our approach requires as little as one-seventh of the computation time of other tuning methods, while achieving better or similar prediction accuracy.Abstract DOI
- CBR–Boosting Adaptive Classification By Retrieval of Encrypted Network Traffic with Out-of-Distribution2024arXiv preprint arXiv:2403.11206
- Exploring the Role of Sentiment in Tutor-Student Interaction. The Case Study of CS and Architecture Formative Studio Critiques20242024 IEEE Frontiers in Education Conference (FIE)
- Few-Shot API Attack Detection: Overcoming Data Scarcity with GAN-Inspired Learning2024arXiv preprint arXiv:2405.11258
- Extending Limited Datasets with GAN-Like Self-Supervision for SMS Spam Detection2024Computers & Security
- OSF-EIMTC: An Open-Source Framework for Standardized Encrypted Internet Traffic Classification2024Computer Communications
- Hidden in Time, Revealed in Frequency: Spectral Features and Multiresolution Analysis for Encrypted Internet Traffic Classification20242024 IEEE 21st Consumer Communications & Networking Conference (CCNC)
- ★ Measuring Flight-Destination Similarity: A Multidimensional Approach2024Expert Systems with ApplicationsE-tourism websites offer users a vast array of travel destinations and opportunities, necessitating tools that enable destination comparison and intelligent search capabilities. One key requirement for such tools is the ability to measure the similarity between destinations. Over the years, various similarity measurement techniques have been proposed, including user-based and content-based approaches. However, many of these techniques require data preparation or prior domain knowledge from experts. In contrast, this study proposes an innovative approach that requires no prior domain knowledge of flight destinations or their relationships, and utilizes only readily available data. Our approach draws upon concepts from image recognition and natural language processing (NLP) to extract hidden aspects of destinations. Using data from a flight-search website as a testbed, we analyze similarity metrics based on state-of-the-art methods for image recognition, NLP, and product-network analysis. We then compare these metrics to those obtained by human subjects. Our findings suggest that no single method dominates in all aspects, leading us to propose a hybrid method that leverages the strengths of each. The proposed method can be readily applied to measure product similarity in other domains.Abstract DOI
- Revolutionizing Our Way to Better Classifiers: Leveraging Synthetic Data with Generative Models for Encrypted Network Traffic Classification2024Available at SSRN 4654236
2023
- Speech and Multilingual Natural Language Framework for Speaker Change Detection and Diarization2023Expert Systems with Applications
- Online Temporary Learning Groups in Higher Education–Interactions, Compensation, and Maximisation of Achievements in an Israeli Case Study2023Journal of Education Culture and Society
- Detecting Parallel Covert Data Transmission Channels in Video Conferencing Using Machine Learning2023Electronics
- When a RF Beats a CNN and GRU, Together—A Comparison of Deep Learning and Classical Machine Learning Approaches for Encrypted Malware Traffic Classification2023Computers & Security
- Protein Intake and Clinical Outcomes of Enterally Fed Critically Ill Patients2023Clinical Nutrition ESPEN
- Using Machine-Learning to Assess the Prognostic Value of Early Enteral Feeding Intolerance in Critically Ill Patients: A Retrospective Study2023Nutrients
2022
- MaMaDroid2.0–The Holes of Control Flow Graphs2022arXiv preprint arXiv:2202.13922
- Problem-Space Evasion Attacks in the Android OS: A Survey2022arXiv preprint arXiv:2205.14576
- Do You Think You Can Hold Me? The Real Challenge of Problem-Space Evasion Attacks2022arXiv preprint arXiv:2205.04293
- The Hidden Conversion Funnel of Mobile vs. Desktop Consumers2022Electronic Commerce Research and Applications
- MalDIST: From Encrypted Traffic Classification to Malware Traffic Detection and Classification20222022 IEEE 19th annual consumer communications & networking conference (CCNC)
- Using the Cardio-Vascular Index (CVRI) to Predict Mortality in Septic Shock2022ISICEM
2021
- Robust Coordination in Adversarial Social Networks: From Human Behavior to Agent-Based Modeling2021Network Science
- A Thousand Words Are Worth More Than One Recording: Word-Embedding Based Speaker Change Detection2021Interspeech
- ★ Hybrid Speech and Text Analysis Methods for Speaker Change Detection2021IEEE/ACM Transactions on Audio, Speech, and Language ProcessingSpeaker Change Detection (SCD) is the task of segmenting an input audio-recording according to speaker interchanges. Nowadays, many applications, such as Speaker Diarization (SD) or automatic vocal transcription, depend on this segmentation task. In this paper, we focus on the essential task of the SD problem, the audio segmenting process, and suggest a solution for the SCD problem, as well as the assignment of clustered speaker labels for the extracted segments, and applying the solution over two datasets: a commercial dataset in Hebrew and the ICSI Meeting Corpus. As such, we propose a hybrid framework for the SCD problem that is learned by textual information and speech signals and the meta-data features that can be extracted from them. Moreover, we demonstrate the negative correlation between an increase in the number of speakers in the training dataset and the influence on the overall diarization system’s performance, which is improved using our efficient SCD component. Finally, we show how our proposed hybrid framework remains robust compared to the ICSI Meeting Corpus, as the experimental evaluation’s training and testing is based on two languages.Abstract DOI
- PCL: Packet Classification with Limited Knowledge2021IEEE INFOCOM 2021-IEEE Conference on Computer Communications
- Feeding Intolerance as a Predictor of Clinical Outcomes in Critically Ill Patients: A Machine Learning Approach2021Clinical Nutrition ESPEN
- Using machine learning to support early prediction of feeding intolerance in critically ill patients2021ESICM LIVES
2020
- Robust Machine Learning for Encrypted Traffic Classification2020CoRR
- Evasion Is Not Enough: A Case Study of Android Malware2020International symposium on cyber security cryptography and machine learning
- Robust Malicious Domain Detection2020Cyber Security Cryptography and Machine Learning: Fourth International Symposium, CSCML 2020, Be’er Sheva, Israel, July 2–3, 2020, Proceedings 4
2019
- Adversarial Coordination on Social Networks2019Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems
- ★ Improving Robustness of ML Classifiers Against Realizable Evasion Attacks Using Conserved Features201928th USENIX Security Symposium (USENIX Security 19)Machine learning (ML) techniques are increasingly common in security applications, such as malware and intrusion detection. However, ML models are often susceptible to evasion attacks, in which an adversary makes changes to the input (such as malware) in order to avoid being detected. A conventional approach to evaluate ML robustness to such attacks, as well as to design robust ML, is by considering simplified feature-space models of attacks, where the attacker changes ML features directly to effect evasion, while minimizing or constraining the magnitude of this change. We investigate the effectiveness of this approach to designing robust ML in the face of attacks that can be realized in actual malware (realizable attacks). We demonstrate that in the context of structure-based PDF malware detection, such techniques appear to have limited effectiveness, but they are effective with content-based detectors. In either case, we show that augmenting the feature space models with conserved features (those that cannot be unilaterally modified without compromising malicious functionality) significantly improves performance. Finally, we show that feature space models enable generalized robustness when faced with a variety of realizable attacks, as compared to classifiers which are tuned to be robust to a specific realizable attack.Abstract
2018
- A crowdsourcing framework for medical data sets2018AMIA Summits on Translational Science Proceedings
2017
- Non-Cooperative Team Formation and a Team Formation Mechanism2017Available at SSRN 3054771
- Enhancing Comparison Shopping Agents Through Ordering and Gradual Information Disclosure2017Autonomous Agents and Multi-Agent Systems
- Selective Opportunity Disclosure at the Service of Strategic Information Platforms2017Autonomous Agents and Multi-Agent Systems
- Enhancing Crowdworkers’ Vigilance2017Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17)
- Agent-Mediated Electronic Commerce. Designing Trading Strategies and Mechanisms for Electronic Markets2017
2016
- Extending Workers’ Attention Span Through Dummy Events2016Proceedings of the AAAI Conference on Human Computation and Crowdsourcing
- Intelligent Mechanisms for Platforms in Modern Markets2016
2015
- Improving Comparison Shopping Agents’ Competence Through Selective Price Disclosure2015Electronic Commerce Research and Applications
- Automatic Anatomical Shape Correspondence and Alignment Using Mesh Features2015Biomedical Imaging (ISBI), 2015
- Strategy-Proof and Efficient Kidney Exchange Using a Credit Mechanism2015In proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence
2014
- Advanced Service Schemes for a Self-Interested Information Platform2014Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems
- Ordering Effects and Belief Adjustment in the Use of Comparison Shopping Agents2014Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2014)
- Strategic Information Platforms: Selective Disclosure and the Price of Free2014Proceedings of the Fifteenth ACM conference on Economics and Computation
2013
- Search More, Disclose Less2013In proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence
2012
- Three Dimensional Group Registration of Mesh Objects2012