Chen Hajaj — AI Research

Chen Hajaj, Associate Professor at Ariel University
Chen Hajaj

Chen Hajaj

|

Ariel University

$1M+
Research Grants
4
Patents
6
Conferences Chaired
50+
Publications
18
h-index
950+
Citations
18
Current Students
29
Alumni

I am an Associate Professor and Head of the Data Science Track at Ariel University — working at the intersection of AI theory and real-world impact. I also serve as Head of the Faculty Review Board and NVIDIA University Ambassador.

Research Translating AI into systems with measurable real-world impact — encrypted traffic analysis, medical decision support, and intelligent recommendation at scale.
Partnerships Close collaborations with the Israel Innovation Authority and the Ministry of Innovation, Science and Technology, bridging academic research and industry deployment.
Mentorship Supervising 18 current students and 29 alumni across PhD and MSc programs, fostering the next generation of AI researchers.

News

Jun 2026
Congratulations to Yarin Amar for being named on the Rector’s List for academic excellence, and to Eylon Yehiel for being awarded the Dean’s List Excellence award. We are proud of your outstanding achievements! 🎓🌟
May 2026
Paper accepted at DCC 2026 (Design, Computing and Cognition), Paris: Learning through Concept Generation: The Case of Academic and Practitioner Teaching in the Design Engineering Studio. 🎨
Apr 2026
Paper accepted at IEEE ICC 2026 in Glasgow: Uncovering Microservice Faults: A Temporal Graph Approach to Root Cause Analysis. 🎉
Mar 2026
Paper accepted at AAMAS 2026: Cleaner Adversarial CAPTCHAs: Intelligent Targets and Precise Noise for Usable Security (Meir Litman & Chen Hajaj). 🔐
Feb 2026
Grant proposal accepted for a research collaboration with CrySyS Lab, Budapest University of Technology. Looking forward to hosting our colleagues soon! 🤝
Feb 2026
New paper accepted in Pediatric Infectious Disease Journal (PIDJ): Machine Learning Tools for Predicting Pediatric Urinary Tract Infections Caused by ESBL-producing Bacteria. 🧬
Jan 2026
New paper published in Computer Networks: Real-Time Network Security: Integrating ANN and Dynamic Graph-Based Clustering — doi:10.1016/j.comnet.2026.112016. 📡
Jan 2026
Two papers presented at IEEE CCNC 2026 in Las Vegas: QoE Prediction for Call of Duty and Cloudy with a Chance of Anomalies (Dynamic GNN for cloud anomaly detection). 🎮🌩️
Dec 2025
New paper in IEEE Access: Survival Forest Models for ICU Mortality Prediction based on Nutrition and Clinical Factors — doi:10.1109/ACCESS.2025.3649629. 🏥
Sep 2025
New paper in IEEE Access: Leveraging OSINT for Advanced Proactive Cybersecurity: Strategies and Solutions — doi:10.1109/ACCESS.2025.3603868. 🕵️

Leadership & Impact

Senior Faculty & Administrative Leadership

NVIDIA University Ambassador (2024–present) — Leading GPU computing and AI education initiatives at Ariel University.

Head of Faculty Review Board (2021–present) — Directing research quality and ethical standards across the Faculty of Engineering.

Head of Data Science Track (2021–present) — Overseeing curriculum development, student programs, and industry partnerships.

Director, Data Science & AI Research Center (2019–2024) — Founded and directed a research center that secured over $1.2M in competitive funding, produced 4 patents, and established national collaborations with the Israel Innovation Authority and the Ministry of Innovation.

Ariel University · 2018–present

Postdoctoral Research

Developed machine learning algorithms for cybersecurity and network traffic analysis. Served on the Postdoc Association Committee, contributing to research policy and mentoring programs.

Vanderbilt University · 2016–2018

Ph.D. in Computer Science

Specialized in game theory and mechanism design, developing theoretical models and algorithms for strategic decision-making in competitive and cooperative environments.

Bar-Ilan University · 2010–2016

Research Focus

Cybersecurity & Network Analysis

Classifying encrypted traffic, detecting anomalies, and building ML-based intrusion detection systems without breaking encryption.

Encrypted TrafficAnomaly DetectionNetwork ML

Medical Data Science

Applying machine learning to clinical data for ICU nutrition optimization, patient outcome prediction, and medical decision support.

Clinical AIICUHealthcare

eCommerce & Recommender Systems

Designing intelligent recommendation engines, pricing strategies, and incentive-aware models that improve user experience and revenue.

Recommender SystemsGame TheoryPersonalization

Design Spaces

Exploring human factors and cognitive dimensions in interface design, virtual environments, and constructivist learning systems.

HCIVR/ARLearning

Human Behavior Analysis

Modeling strategic and social behavior using computational tools — from multi-agent simulations to sentiment and emotion analysis.

Multi-AgentSentimentMechanism Design

Research Grants & Funding

Total Secured: $1M+

Multi-layered Classification of PQC-Encrypted Traffic

2025

Ministry of Innovation, Science and Technology

Co-PIs: Chen Hajaj and Amit Dvir

482900 NIS

Clinical Progression of Dystrophinopathies: Exploring the Role of Autonomic Nervous System Activity and Machine Learning Models

2025

Ministry of Innovation, Science and Technology

Co-PIs: Sharon Barak , Chen Hajaj and Riki Tesler

412569 NIS

Encrypted Networks Traffic Monitoring

2021

Israel Innovation Authority

PI: Chen Hajaj

341000 USD

APK Malware Detection and Analysis

2019

Israel National Cyber Directorate

PI: Chen Hajaj

179000 USD

Quality of Service Monitoring

2019

Israel Innovation Authority

Co-PIs: Chen Hajaj and Amit Dvir

100000 USD

Violence Prediction in Public Events

2019

Israel Innovation Authority

Co-PIs: Chen Hajaj , Amit Dvir and Uzi Ben-Shalom

100000 USD

Strategy Proof Mechanisms for Kidney Exchange

2015

BSF

PI: Chen Hajaj

4000 USD

Join Our Lab

We welcome motivated graduate students passionate about advancing AI research. Our lab offers a collaborative environment, access to cutting-edge projects, strong industry connections, and a track record of high-impact publications.

Collaborative Environment Industry Partnerships Top-tier Publications International Exposure

selected publications

  1. Cleaner Adversarial CAPTCHAs: Intelligent Targets and Precise Noise for Usable Security
    2026
    Meir Litman, and Chen Hajaj
    Proceedings of the 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026)
    Traditional CAPTCHAs are increasingly vulnerable to deep learning-based solvers that decode text and images with high accuracy. In this work, we propose methods to strengthen adversarial CAPTCHAs without compromising human usability. First, we introduce a Precise Gradient Method (PGM) that preserves gradient magnitude (rather than discarding it via a sign operator), producing adversarial perturbations with significantly lower perceptual noise. Second, we develop intelligent target class selection, using either dataset-level confusion structure (Class Relations Network) or image-specific softmax probabilities (Distance-Based Target), to steer adversarial perturbations more efficiently. Across multiple modern architectures (MobileNets, EfficientNets, ResNet, and Vision Transformer), our framework achieves faster convergence (fewer iterations), reduced visual distortion, and notably greater robustness under iterative adversarial retraining. Experiments show that our methods consistently reduce iteration counts and perceptual distortion while significantly increasing the difficulty for automated attacks. Our results offer a practical, scalable path toward the next generation of CAPTCHA systems and contribute new insights to the adversarial machine learning landscape focused on security and usability.
  2. A Classification-by-Retrieval Framework for Few-Shot Anomaly Detection to Detect API Injection
    2025
    Udi Aharon, Ran Dubin, Amit Dvir, and Chen Hajaj
    Computers & Security
    Application Programming Interface (API) Injection attacks refer to the unauthorized or malicious use of APIs, which are often exploited to gain access to sensitive data or manipulate online systems for illicit purposes. Identifying actors that deceitfully utilize an API poses a demanding problem. Although there have been notable advancements and contributions in the field of API security, there remains a significant challenge when dealing with attackers who use novel approaches that don’t match the well-known payloads commonly seen in attacks. Also, attackers may exploit standard functionalities unconventionally and with objectives surpassing their intended boundaries. Thus, API security needs to be more sophisticated and dynamic than ever, with advanced computational intelligence methods, such as machine learning models that can quickly identify and respond to abnormal behavior. In response to these challenges, we propose a novel unsupervised few-shot anomaly detection framework composed of two main parts: First, we train a dedicated generic language model for API based on FastText embedding. Next, we use Approximate Nearest Neighbor search in a classification-by-retrieval approach. Our framework allows for training a fast, lightweight classification model using only a few examples of normal API requests. We evaluated the performance of our framework using the CSIC 2010 and ATRDF 2023 datasets. The results demonstrate that our framework improves API attack detection accuracy compared to the state-of-the-art (SOTA) unsupervised anomaly detection baselines.
  3. The Art of Time-Bending: Data Augmentation and Early Prediction for Efficient Traffic Classification
    2024
    Chen Hajaj, Porat Aharon, Ran Dubin, and Amit Dvir
    Expert Systems with Applications
    Computational efficiency is an important consideration for deploying machine learning models for time series prediction in an online setting. Machine learning algorithms adjust model parameters automatically based on the data, but often require users to set additional parameters, known as hyperparameters. Hyperparameters can significantly impact prediction accuracy. Traffic measurements, typically collected online by sensors, are serially correlated. Moreover, the data distribution may change gradually. A typical adaptation strategy is periodically re-tuning the model hyperparameters, at the cost of computational burden. In this work, we present an efficient and principled online hyperparameter optimization algorithm for Kernel Ridge regression applied to traffic prediction problems. In tests with real traffic measurement data, our approach requires as little as one-seventh of the computation time of other tuning methods, while achieving better or similar prediction accuracy.
  4. Measuring Flight-Destination Similarity: A Multidimensional Approach
    2024
    Anat Goldstein, and Chen Hajaj
    Expert Systems with Applications
    E-tourism websites offer users a vast array of travel destinations and opportunities, necessitating tools that enable destination comparison and intelligent search capabilities. One key requirement for such tools is the ability to measure the similarity between destinations. Over the years, various similarity measurement techniques have been proposed, including user-based and content-based approaches. However, many of these techniques require data preparation or prior domain knowledge from experts. In contrast, this study proposes an innovative approach that requires no prior domain knowledge of flight destinations or their relationships, and utilizes only readily available data. Our approach draws upon concepts from image recognition and natural language processing (NLP) to extract hidden aspects of destinations. Using data from a flight-search website as a testbed, we analyze similarity metrics based on state-of-the-art methods for image recognition, NLP, and product-network analysis. We then compare these metrics to those obtained by human subjects. Our findings suggest that no single method dominates in all aspects, leading us to propose a hybrid method that leverages the strengths of each. The proposed method can be readily applied to measure product similarity in other domains.
  5. Hybrid Speech and Text Analysis Methods for Speaker Change Detection
    2021
    Or Haim Anidjar, Itshak Lapidot, Chen Hajaj, Amit Dvir, and Issachar Gilad
    IEEE/ACM Transactions on Audio, Speech, and Language Processing
    Speaker Change Detection (SCD) is the task of segmenting an input audio-recording according to speaker interchanges. Nowadays, many applications, such as Speaker Diarization (SD) or automatic vocal transcription, depend on this segmentation task. In this paper, we focus on the essential task of the SD problem, the audio segmenting process, and suggest a solution for the SCD problem, as well as the assignment of clustered speaker labels for the extracted segments, and applying the solution over two datasets: a commercial dataset in Hebrew and the ICSI Meeting Corpus. As such, we propose a hybrid framework for the SCD problem that is learned by textual information and speech signals and the meta-data features that can be extracted from them. Moreover, we demonstrate the negative correlation between an increase in the number of speakers in the training dataset and the influence on the overall diarization system’s performance, which is improved using our efficient SCD component. Finally, we show how our proposed hybrid framework remains robust compared to the ICSI Meeting Corpus, as the experimental evaluation’s training and testing is based on two languages.
  6. Improving Robustness of ML Classifiers Against Realizable Evasion Attacks Using Conserved Features
    2019
    Liang Tong, Bo Li, Chen Hajaj, Chaowei Xiao, Ning Zhang, and Yevgeniy Vorobeychik
    28th USENIX Security Symposium (USENIX Security 19)
    Machine learning (ML) techniques are increasingly common in security applications, such as malware and intrusion detection. However, ML models are often susceptible to evasion attacks, in which an adversary makes changes to the input (such as malware) in order to avoid being detected. A conventional approach to evaluate ML robustness to such attacks, as well as to design robust ML, is by considering simplified feature-space models of attacks, where the attacker changes ML features directly to effect evasion, while minimizing or constraining the magnitude of this change. We investigate the effectiveness of this approach to designing robust ML in the face of attacks that can be realized in actual malware (realizable attacks). We demonstrate that in the context of structure-based PDF malware detection, such techniques appear to have limited effectiveness, but they are effective with content-based detectors. In either case, we show that augmenting the feature space models with conserved features (those that cannot be unilaterally modified without compromising malicious functionality) significantly improves performance. Finally, we show that feature space models enable generalized robustness when faced with a variety of realizable attacks, as compared to classifiers which are tuned to be robust to a specific realizable attack.