Skip to main navigation Skip to search Skip to main content

Minimalist LLMs for Network Attack Detection Via Quantization and Low-Rank Adaptation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Large Language Models (LLMs) are increasingly considered for network attack detection; however, their high computational footprint and generative design limit their suitability for real-time cybersecurity deployment. This work introduces a lightweight LLM-based framework for network attack detection that utilizes two LLMs, Gemma-2 and LLaMA-3.2, enhanced through quantization and calibration to make them efficient and accurate. The Low-Rank Adaptation (LoRA) framework is used to fine-tune the models for network traffic, improving their capability to understand traffic patterns. LoRA also reduces the number of trainable parameters by approximately 99%. The fine-tuned weights are further compressed to 4-bit float precision using the BitsAndBytes framework, and quantization quality is enhanced using the NormalFloat4 approach, yielding a 3-4 × memory reduction without the need for retraining. Gemma-2 and LLaMA-3.2 achieve accuracies of 0.99 and 0.95, respectively, with LLaMA-3.2 providing a 0.022 s inference time per sample, suitable for real-time monitoring.
Original languageEnglish
Title of host publicationUnknown book
PublisherInstitute of Electrical and Electronics Engineers Inc.
EditionIssue
DOIs
StatePublished - 2025

Fingerprint

Dive into the research topics of 'Minimalist LLMs for Network Attack Detection Via Quantization and Low-Rank Adaptation'. Together they form a unique fingerprint.

Cite this