International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 8, Issue 3 (May-June 2026) Submit your research before last 3 days of June to publish your research paper in the issue of May-June.

Real Time Network Intrusion Detection with Optimized Inference

Author(s) Mr. Jayanthi Sayi Surya Ryali, Mr. Kaushik Venkata Vasantha Kumar Pentyala, Mr. Himavanth Sai Potturi, Mr. Purna Chandra Reddy Putta, V. Asis Marceline
Country India
Abstract Intrusion Detection Systems (IDS) are essential for protecting modern networked environments against increasingly sophisticated cyber threats. Although machine learning techniques have improved detection capabilities, their deployment in real-time systems is often limited by inference latency and scalability constraints. This paper presents a deep learning-based IDS using a multilayer perceptron (MLP) trained on the NSL-KDD dataset, with emphasis on optimizing inference performance for practical deployment scenarios.
To address deployment challenges, the trained model is exported to ONNX format and evaluated using multiple execution backends, including ONNX Runtime and TensorRT on both CPU and GPU platforms. Additionally, the model is deployed using the Triton Inference Server to assess performance in a production-oriented setting. The proposed pipeline enables a comprehensive comparison of latency and throughput across different inference frameworks. Experimental results show that optimized inference using TensorRT achieves up to 4.8 times reduction in latency compared to the baseline PyTorch implementation, while maintaining comparable classification performance. The findings demonstrate that deployment-aware optimization plays a crucial role in building efficient IDS solutions and provide insights into selecting appropriate inference strategies for low-latency, high-throughput environments.
Keywords Intrusion Detection System (IDS), Deep Learning, Multilayer Perceptron (MLP), NSL-KDD, Model Optimization, ONNX Runtime, TensorRT, Triton Inference Server, Low-Latency Inference, Network Security.
Field Computer > Artificial Intelligence / Simulation / Virtual Reality
Published In Volume 8, Issue 3, May-June 2026
Published On 2026-05-05

Share this