Encrypted Traffic Classification
AI-powered classification of encrypted network traffic — enabling security monitoring and QoS management without compromising encryption or user privacy.
Over 90% of internet traffic is now encrypted. While encryption protects privacy, it makes traditional network monitoring impossible. We build AI systems that classify and analyze encrypted traffic without decryption — enabling security, quality of service, and compliance while preserving user privacy.
Key Contributions
- PQClass: Post-Quantum Traffic Classification — The first system to classify traffic encrypted with post-quantum algorithms (CRYSTALS-Kyber, NTRU), ensuring network visibility during the quantum cryptography transition.
- GAN-Based Data Synthesis — Generative adversarial networks that produce realistic encrypted traffic datasets, addressing the scarcity of shareable labeled data in this domain.
- Spectral Analysis for Protocol Identification — Frequency-domain analysis of packet inter-arrival times and sizes reveals application fingerprints inside TLS 1.3 and ESNI without any payload inspection.
- Zero-Day Attack Detection — SimCSE-based contrastive learning builds traffic representations that generalize to unseen attack patterns without retraining on labeled attack data.
Impact
Our methods are validated on real-world datasets from commercial ISPs and academic network captures. Applications include real-time threat detection in enterprise networks, QoE optimization for streaming services, and network capacity planning — all with full encryption preserved.
Related Publications
2026
- Quality of Experience Prediction for First Person Shooter Online Gaming: The Case Study of Call of Duty2026Proceedings of the IEEE Consumer Communications & Networking Conference (CCNC 2026)
2025
- Enhancing Encrypted Internet Traffic Classification Through Advanced Data Augmentation Techniques2025Proceedings of the IEEE International Conference on Communications
- PQClass: Classification of Post-Quantum Encryption Applications in Internet Traffic2025Proceedings of the IEEE International Conference on Communications
- ★ A Classification-by-Retrieval Framework for Few-Shot Anomaly Detection to Detect API Injection2025Computers & SecurityApplication Programming Interface (API) Injection attacks refer to the unauthorized or malicious use of APIs, which are often exploited to gain access to sensitive data or manipulate online systems for illicit purposes. Identifying actors that deceitfully utilize an API poses a demanding problem. Although there have been notable advancements and contributions in the field of API security, there remains a significant challenge when dealing with attackers who use novel approaches that don’t match the well-known payloads commonly seen in attacks. Also, attackers may exploit standard functionalities unconventionally and with objectives surpassing their intended boundaries. Thus, API security needs to be more sophisticated and dynamic than ever, with advanced computational intelligence methods, such as machine learning models that can quickly identify and respond to abnormal behavior. In response to these challenges, we propose a novel unsupervised few-shot anomaly detection framework composed of two main parts: First, we train a dedicated generic language model for API based on FastText embedding. Next, we use Approximate Nearest Neighbor search in a classification-by-retrieval approach. Our framework allows for training a fast, lightweight classification model using only a few examples of normal API requests. We evaluated the performance of our framework using the CSIC 2010 and ATRDF 2023 datasets. The results demonstrate that our framework improves API attack detection accuracy compared to the state-of-the-art (SOTA) unsupervised anomaly detection baselines.Abstract DOI
2024
- ★ The Art of Time-Bending: Data Augmentation and Early Prediction for Efficient Traffic Classification2024Expert Systems with ApplicationsComputational efficiency is an important consideration for deploying machine learning models for time series prediction in an online setting. Machine learning algorithms adjust model parameters automatically based on the data, but often require users to set additional parameters, known as hyperparameters. Hyperparameters can significantly impact prediction accuracy. Traffic measurements, typically collected online by sensors, are serially correlated. Moreover, the data distribution may change gradually. A typical adaptation strategy is periodically re-tuning the model hyperparameters, at the cost of computational burden. In this work, we present an efficient and principled online hyperparameter optimization algorithm for Kernel Ridge regression applied to traffic prediction problems. In tests with real traffic measurement data, our approach requires as little as one-seventh of the computation time of other tuning methods, while achieving better or similar prediction accuracy.Abstract DOI
- CBR–Boosting Adaptive Classification By Retrieval of Encrypted Network Traffic with Out-of-Distribution2024arXiv preprint arXiv:2403.11206
- OSF-EIMTC: An Open-Source Framework for Standardized Encrypted Internet Traffic Classification2024Computer Communications
- Hidden in Time, Revealed in Frequency: Spectral Features and Multiresolution Analysis for Encrypted Internet Traffic Classification20242024 IEEE 21st Consumer Communications & Networking Conference (CCNC)
- Revolutionizing Our Way to Better Classifiers: Leveraging Synthetic Data with Generative Models for Encrypted Network Traffic Classification2024Available at SSRN 4654236
2023
- When a RF Beats a CNN and GRU, Together—A Comparison of Deep Learning and Classical Machine Learning Approaches for Encrypted Malware Traffic Classification2023Computers & Security
2022
- MalDIST: From Encrypted Traffic Classification to Malware Traffic Detection and Classification20222022 IEEE 19th annual consumer communications & networking conference (CCNC)
2021
- PCL: Packet Classification with Limited Knowledge2021IEEE INFOCOM 2021-IEEE Conference on Computer Communications
2020
- Robust Machine Learning for Encrypted Traffic Classification2020CoRR