TraceCodec: A Compiler-Backed Neural Codec for Stateful Multi-Flow Network Traffic Traces
Authors: Junhui Ding, Xinchen Zhang, Xiaohui Xie, Shinan Liu
First: 2026-05-28T13:52:40+00:00 · Latest: 2026-05-28T13:52:40+00:00
Abstract
Critical networking workflows require high-fidelity packet captures (PCAPs) for testing, security analysis, and protocol validation, not just statistical flow-level summaries. Recent packet generators have demonstrated protocol-constrained PCAP synthesis, but they universally decode directly to raw packet fields. That interface entangles learned behavioral choices with deterministic protocol consequences, which forces packet realization to depend on post-hoc heuristic repair. We identify this decode interface as the fundamental bottleneck and present TraceCodec, a state-aware neural codec for stateful multi-flow traces. TraceCodec lifts each packet into a timed packet action with explicit flow slots and transport cues, then learns a continuous per-packet latent. A deterministic compiler lowers decoded actions back to PCAPs, owning endpoint assignment, TCP state, legality constraints, and packet rendering. The latent layer exposes a generator-facing sequence space, so downstream traffic models can operate on packet-action latents rather than raw header fields. On CICIDS2017 Monday, TraceCodec matches packet count, protocol composition, and flow population to within 0.03%. Raw-field baselines under the same non-repair policy distort flow counts and TCP state by orders of magnitude. Structural diagnostics show that TraceCodec preserves TCP state transitions and multi-flow interleaving that raw-field decoders fragment. This work establishes a new foundation for high-fidelity packet-trace generation.
Summary / 总结
Critical networking workflows require high-fidelity packet captures (PCAPs) for testing, security analysis, and protocol validation, not just statistical flow-level summaries.
A Comprehensive Protocol Stack for Quantum Networks with a Global Entanglement Module
Authors: Xiaojie Fan, C. R. Ramakrishnan, Himanshu Gupta
First: 2025-09-20T21:39:25+00:00 · Latest: 2026-05-28T01:41:34+00:00
Abstract
The development of large-scale quantum networks requires not only advances in physical-layer technologies but also a comprehensive protocol stack that integrates communication, control, and resource management across all layers. We present the first such protocol stack, which introduces a Global Entanglement Module (GEM) that maintains a consistent, network-wide view of entanglement resources through distributed synchronization strategies. By enabling real-time adaptive execution of entanglement distribution plans, GEM bridges the gap between static planning and dynamic operation. The stack naturally supports pre-distributed entanglement, purification, and multi-partite state generation, making it applicable to a broad range of quantum networking applications. We design and evaluate multiple adaptive heuristics for real-time execution and show that a lightweight scoring-based strategy consistently achieves the best performance, improving entanglement generation rates by about 20% over a globally optimal but non-adaptive fixed-tree baseline and achieving more than a two-fold improvement relative to recent connectionless approaches. Across all scenarios-including predistribution and fidelity analysis-GEM consistently enables lower latency and robust operation. These results establish a practical pathway toward scalable, adaptive quantum internet systems.
Summary / 总结
The development of large-scale quantum networks requires not only advances in physical-layer technologies but also a comprehensive protocol stack that integrates communication, control, and resource management across all layers.
Dyna-5G: Dynamic Role Switching for Self-Organizing 5G M2M Networks
Authors: Evangelos Bitsikas, Adam Belfki, Aanjhan Ranganathan
First: 2024-06-21T23:11:45+00:00 · Latest: 2026-05-27T19:53:50+00:00
Abstract
M2M deployments such as drone swarms demand mission-critical communication: km-scale range, strong per-device identity and mutual authentication, and deterministic QoS for bandwidth-intensive payloads. Cellular 5G uniquely satisfies all of these, yet it has seen limited adoption in autonomous fleets. The barrier is not capability but resilience: today's 5G networks assume fixed infrastructure, and when the base station fails, recovery is uniquely complex. Unlike simpler wireless protocols where devices can transparently switch nodes, 5G failure requires reconstructing distributed state such as authentication contexts, QoS bindings, tunnels, and RRC state machines across the fleet, a process that no existing system automates. We present Dyna-5G, which makes this happen. Dyna-5G is the first 5G Standalone-compliant framework for dynamic role switching in M2M fleets, where any device can assume the role of 5G Core, RAN, or UE at runtime. It orchestrates failure detection, leader selection, and coordinated state teardown and re-establishment, all without modifying 3GPP protocols. We evaluate Dyna-5G on a high-fidelity software emulation testbed, with Open5GS and srsRAN, across hundreds of trials with up to 10 drones. Control-plane overhead averages 0.47 Mb/s (approximately 0.47% of a 100 Mb/s bearer), while failure recovery completes in about 2.5 s, of which approximately 86% is due to stack-dependent cellular procedures. Dyna-5G's orchestration logic itself adds only about 175 ms per reconfiguring role. All tested missions complete successfully, even under injected leader crashes.
Summary / 总结
M2M deployments such as drone swarms demand mission-critical communication: km-scale range, strong per-device identity and mutual authentication, and deterministic QoS for bandwidth-intensive payloads.
OpenURMA: A Clean-Room Open Implementation of the Unified Bus Protocol
Authors: Bojie Li
First: 2026-05-27T16:38:57+00:00 · Latest: 2026-05-27T16:38:57+00:00
Abstract
Modern datacenter RDMA is bottlenecked at the network interface, not the wire. A NIC running RoCE or InfiniBand holds per-connection state for every (application, remote-endpoint) pair - hundreds of megabytes at 1024-application fanout - and pays a four-traversal PCIe round trip on a 64-byte operation, inflating latency an order of magnitude beyond the wire. Both follow from the Queue Pair over PCIe abstraction RDMA inherits from InfiniBand.
Huawei's Unified Bus (UB), a public 2025 specification, changes the abstraction: it decouples per-application endpoint state from per-host transport state so connection context grows additively, exposes ordering as opt-in, and reaches remote memory through native CPU load/store to an on-chip-bus controller. UB ships in Huawei's closed Ascend 950 silicon.
OpenURMA is the first clean-room open implementation of UB's transport and transaction layers, realised at three tiers - synthesisable RTL on Alveo U50, a cycle-level two-node SystemC simulator, and a gem5 full-system scaffold - each with a matched OpenRoCE (RoCEv2 RC) baseline. The contribution is the implementation, harness, and controlled comparison closed silicon does not admit. On the canonical 64-byte remote fetch - LOAD on UB-spec Sec.8.3, READ on RoCEv2 RC - UB's load/store path delivers ~500 ns end-to-end, 4.37x below the matched baseline (2186 ns), sustains 2.80x higher throughput, and fits in ~14% of a U50's LUTs.
Summary / 总结
Modern datacenter RDMA is bottlenecked at the network interface, not the wire.
Scaling Multi-agent Systems: A Smart Middleware for Improving Agent Interactions
Authors: Charles Fleming, Guillaume De Saint Marc, Ramana Kompella, Peter Bosch, Vijoy Pandey
First: 2026-04-03T19:58:01+00:00 · Latest: 2026-05-27T15:58:08+00:00
Abstract
As Large Language Model (LLM) based Multi-Agent Systems (MAS) evolve from experimental pilots to complex, persistent ecosystems, the limitations of direct agent-to-agent communication have become increasingly apparent. Current architectures suffer from fragmented context, stochastic hallucinations, rigid security boundaries, and inefficient topology management. This paper introduces Cognitive Fabric Nodes (CFN), a novel middleware layer that creates an omnipresent "Cognitive Fabric" between agents. Unlike traditional message queues or service meshes, CFNs are not merely pass-through mechanisms; they are active, intelligent intermediaries.
Central to this architecture is the elevation of Memory from simple storage to an active functional substrate that informs four other critical capabilities: Topology Selection, Semantic Grounding, Security Policy Enforcement, and Prompt Transformation. We propose that each of these functions be governed by learning modules utilizing Reinforcement Learning (RL) and optimization algorithms to improve system performance dynamically. By intercepting, analyzing, and rewriting inter-agent communication, the Cognitive Fabric ensures that individual agents remain lightweight while the ecosystem achieves coherence, safety, and semantic alignment.
We evaluate the effectiveness of the CFN on the HotPotQA and MuSiQue datasets in a multi-agent environment and demonstrate that the CFN improves performance by more than 10\% on both datasets over direct agent to agent communication.
Summary / 总结
As Large Language Model (LLM) based Multi-Agent Systems (MAS) evolve from experimental pilots to complex, persistent ecosystems, the limitations of direct agent-to-agent communication have become increasingly apparent.
Efficient and Quantum-safe Internet Key Exchange Protocols for Satellite Communications
Authors: Davide De Zuane, Marco Baldi, Paolo Santini, Grégoire Anchelergues, Daniele Romano, Alessandro Cammarano, Juan José Grosso
First: 2026-05-27T15:58:04+00:00 · Latest: 2026-05-27T15:58:04+00:00
Comments: 6 pages, accepted for presentation at IEEE LANMAN 2026
Abstract
This paper studies cryptographic key exchange in satellite communications, which requires specific solutions because the satellite context presents unique challenges, particularly concerning onboard resource constraints and long transmission latency. We address these challenges by considering the Internet Key Exchange (IKE) protocol, which is widely used in terrestrial networks, and studying its applicability in the satellite context. This requires addressing two main issues: i) its efficiency in terms of the resources and bandwidth required to adapt to satellite terminals, and ii) its resistance even to attackers equipped with a quantum computer, in order to resist obsolescence and defend against harvest-now-decrypt-later attacks. We study these aspects from both a design and experimental point of view, defining and assessing some protocol variants characterized by low complexity and quantum resistance. To address the need to manage the transition from classic cryptographic primitives to post-quantum ones, we also consider the possibility of using hybrid cryptographic solutions that combine them both.
Summary / 总结
This paper studies cryptographic key exchange in satellite communications, which requires specific solutions because the satellite context presents unique challenges, particularly concerning onboard resource constraints and long transmission latency.
Automated Heuristic Design for Network Operations
Authors: Reza Namvar, José Gallego, Jose A. Ayala-Romero, Livia Elena Chatzieleftheriou, Andres Garcia-Saavedra, Albert Banchs, Marco Fiore
First: 2026-05-27T09:20:51+00:00 · Latest: 2026-05-27T09:20:51+00:00
Abstract
Network operation relies on heuristics to solve many tasks rapidly and efficiently across the protocol stack. These heuristics are the result of thorough human-driven design rooted in expert knowledge of the target system and problem. Recently, approaches powered by Artificial Intelligence have shown promising results in devising solutions that outperform long-established heuristics in classical problems. We explore the possibility of applying such Automated Heuristic Design (AHD) frameworks to network environments by (i) discussing the general integration of AHD with network operation and the associated challenges, as well as (ii) proposing a practical implementation of AHD for a specific networking task, i.e., 5G decoding. Initial results show how modern AHD tools can devise heuristics for Low-Density Parity Check decoding on par with state-of-the-art solutions implemented in production systems.
Summary / 总结
Network operation relies on heuristics to solve many tasks rapidly and efficiently across the protocol stack.
Kernel-Level Per-Slice UPF Latency Measurement in Containerised 5G Core Networks
Authors: Akhil Dev Mishra, Mayank Pandey
First: 2026-05-27T09:08:43+00:00 · Latest: 2026-05-27T09:08:43+00:00
Comments: 4 pages, 3 figures, dataset and code at https://github.com/MP-Akhil-5G/open5gs-slice-measurement
Abstract
The 5G Core User Plane Function is responsible for packet forwarding, GTP-U decapsulation, and quality of service enforcement for every user data session. How the UPF behaves under simultaneous multi-slice workloads remains empirically uncharacterised in the open literature. Specifically, how its forwarding latency responds to load, how well it isolates one slice from another, and what timing budgets remain available for intelligent control are all open questions. This paper presents a measurement study conducted on a containerised open5GS deployment with three concurrent network slices. We design and implement a namespace-aware TC-BPF instrumentation framework that resolves the fundamental obstacle preventing existing tools from attributing latency observations to individual containerised network functions. We deploy eMBB, URLLC, and mMTC slices with realistic application traffic under light, medium, and heavy load conditions and collect approximately 28 million matched N3 to N6 forwarding delay pairs. The gathered results reveal that eMBB forwarding delay is load-sensitive with the 99th percentile growing from 574 to 1,243 microseconds across load conditions. URLLC delay is load-insensitive, confirming per-UPF process isolation. mMTC exhibits wide-tail TCP behaviour. On this platform, N4 PFCP session modification latency remains consistently below 200 microseconds regardless of data-plane load, suggesting substantial timing headroom within the two-millisecond budget assumed by AI-driven UPF orchestration designs. The instrumentation framework, experiment scripts, and dataset schema are released at https://github.com/MP-Akhil-5G/open5gs-slice-measurement.
Summary / 总结
The 5G Core User Plane Function is responsible for packet forwarding, GTP-U decapsulation, and quality of service enforcement for every user data session.
A Preliminary Assessment of Midhaul Links at 140 GHz using Ray-Tracing
Authors: Sravan Reddy Chintareddy, Marco Mezzavilla, Sundeep Rangan, Morteza Hashemi
First: 2026-05-26T23:42:53+00:00 · Latest: 2026-05-26T23:42:53+00:00
Abstract
The ever-growing demand for mobile data necessitates a transport network architecture that can withstand the 5G-and-beyond multi-Gbps traffic requirements. To cater for such unprecedented demand, studies are being conducted to incorporate TeraHertz (THz) communications in future mobile networks. In this paper, we consider an urban environment and evaluate the feasibility of THz wireless midhaul links for the transport networks between the Central Units (CU) and Distributed Units (DU) in a disaggregated 5G network architecture with functional splits. Our goal is to study the feasibility of midhaul links at 140 GHz by minimizing the number of required CUs to serve all the DUs. To this end, we define several policies for selecting CU and DU nodes in order to determine the peak data rate that can be supported over each link between a CU and DU. Our numerical results based on ray-tracing suggest that wireless links at 140 GHz with 3GPP option 2 as High Layer Split (HLS) represents a promising technology for midhaul transport networks.
Summary / 总结
The ever-growing demand for mobile data necessitates a transport network architecture that can withstand the 5G-and-beyond multi-Gbps traffic requirements.
RouteProfile: Graph-Based Profiling for Cold-Start LLM Routing
Authors: Jingjun Xu, Hongji Pu, Tao Feng, Haozhen Zhang, Jiaxuan You, Ge Liu
First: 2026-04-30T19:56:08+00:00 · Latest: 2026-05-26T23:15:01+00:00
Abstract
LLM routing is increasingly important for selecting suitable models under diverse user needs and deployment constraints, but its practical effectiveness depends on continual adaptation to emerging queries and newly released models. New-LLM integration is particularly challenging, as newly released models lack the query-response-reward interactions required for router training and cannot be profiled as directly as new queries via semantic embeddings. Existing profiles are limited: LLM-generated descriptions are often coarse, while interaction-based embeddings are costly to construct. To address this problem, we propose RouteProfile, a graph-based profiling framework that constructs LLM profiles from public signals in technical reports or model cards, including model family, model description, reported benchmark scores, and benchmark domains. RouteProfile organizes these heterogeneous signals into a graph and studies profile construction along four dimensions: organizational form, representation type, aggregation depth, and learning configuration. We evaluate RouteProfile in training-free cold-start routing and new-LLM integration settings. Experiments show that: (1) structured profiles outperform flat baselines in training-free cold-start routing; (2) model family metadata is more reliable than benchmark domain information; and (3) effective new-LLM integration requires profile-router co-design. Overall, our findings highlight the importance of profile design for enabling routing systems to adapt to the evolving model ecosystem.
Summary / 总结
LLM routing is increasingly important for selecting suitable models under diverse user needs and deployment constraints, but its practical effectiveness depends on continual adaptation to emerging queries and newly released models.
Characterizing the Configuration of Starlink Queuing
Authors: Johan Garcia, Simon Sundberg, Anna Brunstrom
First: 2026-05-26T21:43:56+00:00 · Latest: 2026-05-26T21:43:56+00:00
Comments: This is an author-supplied definitive version of a paper accepted at IMC'26 cycle 1 on 2nd Feb 2026. First submitted to IMC'25 in 15th May 2025 and accepted with one-shot-revision on 15th Aug 2025. Please cite with ACM reference format once the ACM DOI is active. 10 pages, 7 figures
Abstract
In all networking systems, queuing is important to ensure appropriate resource utilization in the presence of bursty traffic and varying traffic demands. The Starlink access network is additionally also dynamic in terms of the capacity it can provide, and thus queuing plays an even greater role to ensure appropriate communication performance for the end-users while maintaining high resource utilization. However, for Starlink most system design details, along with the setup of the internal queuing, is private information and not publicly available. To address this we have developed a high-precision, burst-pattern controlled, traffic generation approach allowing us to precisely measure the one-way delay for Starlink. By analyzing the delay and loss in conjunction with a queue simulator we find that Starlink does not employ per-flow fair queuing or drop-tail buffers, but it does use drop-front buffer management. While drop-front reduces delay, it may also interfere with the assumptions made by loss-based congestion controls, potentially contributing to throughput degradation.
Summary / 总结
In all networking systems, queuing is important to ensure appropriate resource utilization in the presence of bursty traffic and varying traffic demands.
GENESIS: Harnessing AI Agents for Autonomous 6G RAN Synthesis, Research, and Testing
Authors: Tamerlan Aghayev, Maxime Elkael, Michele Polese, Minh Dat Nguyen, Gabriele Gemmi, Andrea Lacava, Ali Saeizadeh, Reshma Prasad, Paolo Testolina, Angelo Feraudo, Soumendra Nanda, Pedram Johari, Salvatore D'Oro, Tommaso Melodia
First: 2026-05-26T17:58:43+00:00 · Latest: 2026-05-26T17:58:43+00:00
Comments: 18 pages, 16 figures
Abstract
Cellular research and development (R&D) is throttled by six structural processes that each consume months of manual engineering work per iteration: (i) synthesizing new features from standards or research papers into production code; (ii) conformance and interoperability testing; (iii) hardening against field anomalies and diverse deployment environments; (iv) data-driven optimization of network functionalities; (v) discovering and prototyping novel waveforms, functionalities, and capabilities for future standards; and (vi) securing the stack against vulnerabilities. Although Large Language Models (LLMs) have compressed comparable R&D work in general software engineering from days to minutes, their known pitfalls worsen on Radio Access Network (RAN) use cases: they hallucinate Application Programming Interfaces (APIs) and mis-read specifications, which kills interoperability of RAN components at the first mistake, and they heavily rely on simulations for designing algorithms, which is notorious for breaking when transferred to real hardware. To address these challenges, we present GENESIS, an agentic Artificial Intelligence (AI) framework that converts intents (e.g., a specification clause, a telemetry anomaly, or a research hypothesis) into solutions validated with over-the-air experiments, fed back into a persistent knowledge base. GENESIS is built on three composable primitives (agents, skills, hooks) and a knowledge layer (SYNAPSE) that doubles as the source of ground truth and the recipient of every artifact the framework produces, making capabilities compound across runs.
Summary / 总结
Cellular research and development (R&D) is throttled by six structural processes that each consume months of manual engineering work per iteration: (i) synthesizing new features from standards or research papers into production code; (ii) conformance and interoperability testing; (iii) hardening against field anomalies and diverse deployment environments; (iv) data-driven optimization of network functionalities; (v) discovering and prototyping novel waveforms, functionalities, and capabilities for future standards; and (vi) securing the stack against vulnerabilities.
Latency in Real-Time 3D Volumetric Streaming: A Comprehensive Study
Authors: Seungwoo Hong, Hosun Yoon, Seong Moon, Inayat Ali
First: 2026-05-21T08:05:45+00:00 · Latest: 2026-05-26T02:39:59+00:00
Comments: 6 pages, 11 figures
Abstract
Real-time 3D volumetric streaming is a transformative technology that enables the seamless transmission and rendering of high-fidelity 3D models, enhancing applications in virtual reality (VR), augmented reality (AR), gaming, telepresence, and remote collaboration. However, latency remains a major challenge, affecting immersion, causing motion sickness, and disrupting real-time interactions. Addressing these latency issues is essential for improving user experience and ensuring system efficiency. This study conducts a comprehensive latency measurement and analysis within a real-time volumetric streaming environment. We systematically break down the streaming process into three key layers: the application layer, the transport protocol layer, and the network layer. By evaluating each layer in a real-world system, we identify latency bottlenecks, quantify their impact, and uncover the underlying causes of delay. Based on these findings, we propose targeted optimization strategies to mitigate latency and enhance system responsiveness. Through this research, we establish best practices and innovative solutions to improve the efficiency, scalability, and overall user experience of real-time 3D volumetric streaming. Our insights contribute to advancing the field, paving the way for more immersive and responsive digital environments.
Summary / 总结
Real-time 3D volumetric streaming is a transformative technology that enables the seamless transmission and rendering of high-fidelity 3D models, enhancing applications in virtual reality (VR), augmented reality (AR), gaming, telepresence, and remote collaboration.
Beyond Traffic Matrix: DELTA -- A DAG-Aware OCS Logical Topology Optimization for AIDCs
Authors: Niangen Ye, Jingya Liu, Guofu Zhu, Weiqiang Sun, Weisheng Hu
First: 2026-03-30T06:54:48+00:00 · Latest: 2026-05-26T00:48:38+00:00
Abstract
The rapid scaling of large language models (LLMs) exacerbates communication bottlenecks in AI data centers (AIDCs). To overcome this, optical circuit switches (OCS) are increasingly adopted for their superior bandwidth capacity and energy efficiency. However, their reconfiguration overhead precludes intra-iteration topology update, necessitating a priori engineering of a static topology to absorb time-varying LLM traffic. Existing methods engineer these topologies based on traffic matrices. However, this representation obscures the bursty concurrent bandwidth demands dictated by parallelization strategies and fails to account for the independent channels required for concurrent communication.
To address this, we propose DELTA, an efficient logical topology optimization framework for AIDCs that leverages the computation-communication directed acyclic graph (DAG) to encode time-varying traffic patterns into a Mixed-Integer Linear Programming (MILP) model, while exploiting the temporal slack of non-critical tasks to save optical ports without penalizing iteration makespan. By pioneering a variable-length time interval formulation, DELTA significantly reduces the solution space compared to the fixed-time-step formulation. To scale to thousand-GPU clusters, we design a dual-track acceleration strategy that combines search space pruning (reducing complexity from quadratic to linear) with heuristic hot-starting. Evaluations on large-scale LLM workloads show that DELTA reduces communication time by up to 17.5% compared to state-of-the-art traffic-matrix-based baselines. Furthermore, the framework reduces optical port consumption by at least 20%; dynamically reallocating these surplus ports to bandwidth-bottlenecked workloads reduces their performance gap relative to ideal non-blocking electrical networks by up to 26.1%, ultimately enabling most workloads to achieve near-ideal performance.
Summary / 总结
The rapid scaling of large language models (LLMs) exacerbates communication bottlenecks in AI data centers (AIDCs).
Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live
Authors: Hanchen Li, Runyuan He, Qiuyang Mang, Qizheng Zhang, Huanzhi Mao, Xiaokun Chen, Hangrui Zhou, Alvin Cheung, Joseph Gonzalez, Ion Stoica
First: 2025-11-04T03:43:05+00:00 · Latest: 2026-05-25T23:34:23+00:00
Abstract
KV cache management is essential for efficient LLM inference. To maximize utilization, existing inference engines evict finished requests' KV cache if new requests are waiting. This policy breaks for agentic workloads, which interleave LLM calls with tools, introducing pauses that prevent effective KV reuse across turns. Since many tool calls have much shorter durations than human response multi-turn chatbot, it would be promising to retain the KV cache in during these tools. However, many challenges remain. First, we need to consider both the potential cost of recomputation or reloading (if offloading enabled) as well as the increasing queueing delays after eviction from GPU. Second, due to the internal variance of tool call durations, the method needs to remain robust under limited predictability of tool call durations.
We present Continuum, a serving system to optimize job completion time for multi-turn agent workloads by introducing time-to-live mechanism for KV cache retention. For requests that generate tool calls, Continuum selectively pins the KV cache in GPU memory with a time-to-live value determined by the reload cost and potential queueing delay induced by eviction. When the TTL expires, the KV cache can be automatically evicted to free up GPU memory, providing robust performance under edge cases. When combined with program-level first-come-first-serve, Continuum preserves multi-turn continuity, and reduces delay for agentic workflows. Evaluations on real-world agents (SWE-Bench, BFCL, OpenHand) with Llama-3.1 8B/70B, Gemma-3 12B, and GLM-4.5 355B shows that Continuum improves the average job completion times by over 8x while improving throughput.
Summary / 总结
KV cache management is essential for efficient LLM inference.
Reexamining Paradigms of End-to-End Data Movement
Authors: Chin Fang, Timothy Stitt, Michael J. McManus, Toshio Moriya
First: 2025-12-17T02:38:06+00:00 · Latest: 2026-05-25T20:03:47+00:00
Comments: 33 pages and 15 figures
Abstract
The pursuit of high-performance data transfer often focuses on raw network bandwidth. International links of 100 Gbps or higher are frequently considered the primary enabler. While necessary, this network-centric view is incomplete. It equates provisioned link speeds with practical, sustainable data movement capabilities. It is a common observation that lower-than-desired data rates manifest even on 10 Gbps links, with higher-speed networks only amplifying their visibility. We investigate six paradigms -- from network latency and TCP congestion control to host-side factors such as CPU performance and virtualization -- that critically impact data movement workflows. These paradigms represent widely accepted engineering assumptions that inform system design, procurement decisions, and operational practices in production data movement environments. We introduce the Drainage Basin Pattern conceptual model for reasoning about end-to-end data flow constraints across heterogeneous hardware and software components at varying desired data rates to address the fidelity gap between raw bandwidth and application-level throughput. Our findings are validated through rigorous production-scale deployments, from 10 Gbps links to U.S. DOE ESnet technical evaluations and transcontinental production trials over 100 Gbps operational links. The results demonstrate that principal bottlenecks often reside outside the network core, and that a holistic hardware-software co-design enables consistent, predictable performance for demanding data transports (bulk and streaming).
The key goal is to transform a demanding data transfer from a struggle with unknown outcomes into a predictable, guaranteed line-rate, routine operation that anyone can do. Another goal is to rectify the general misconception that conflates complexity with expertise.
Summary / 总结
The pursuit of high-performance data transfer often focuses on raw network bandwidth.
Intelligent Detection and Mitigation of Carpet-Bombing DDoS Attacks in SDN Using Retrieval-Augmented Generation and Large Language Models
Authors: Mohammed N. Swileh, Shengli Zhang, Kai Lei
First: 2026-05-25T19:58:45+00:00 · Latest: 2026-05-25T19:58:45+00:00
Abstract
Software-Defined Networking (SDN) provides flexible and programmable network management; however, its centralized control architecture remains highly vulnerable to Distributed Denial-of-Service (DDoS) attacks, particularly Carpet-Bombing DDoS attacks that distribute malicious traffic across multiple targets to evade conventional detection mechanisms. In this paper, a Retrieval-Augmented Generation (RAG)-based framework is proposed for real-time detection and mitigation of Carpet-Bombing DDoS attacks in SDN environments. The proposed framework combines interface-level traffic features representation, semantic embedding generation, FAISS-based similarity retrieval, and Large Language Model (LLM)-driven contextual inference to classify traffic behavior without requiring conventional supervised model training or retraining. To evaluate the effectiveness of the proposed framework, extensive experiments were conducted under multiple Carpet-Bombing DDoS attack scenarios with different attack intensities. In addition, two traffic representation strategies, namely structured JSON-based representation and natural language-based representation (NLR), were investigated using multiple state-of-the-art LLMs. The experimental results demonstrate that the proposed framework achieved highly accurate and stable attack detection performance, while the framework configuration utilizing the Gemma-4-31B-IT model achieved the strongest overall detection results. Furthermore, real-time experiments confirmed the capability of the proposed framework to rapidly detect and mitigate Carpet-Bombing DDoS attacks while maintaining stable SDN network operation. The obtained results highlight the effectiveness of integrating RAG mechanisms with LLM for intelligent and adaptive SDN security analysis.
Summary / 总结
Software-Defined Networking (SDN) provides flexible and programmable network management; however, its centralized control architecture remains highly vulnerable to Distributed Denial-of-Service (DDoS) attacks, particularly Carpet-Bombing DDoS attacks that distribute malicious traffic across multiple targets to evade conventional detection mechanisms.
Neural Router: Semantic Content Matching for Agentic AI
Authors: Lauri Lovén, Abhishek Kumar, Alexander Engelhardt, Alaa Saleh, Roberto Morabito, Xiaoli Liu, Naser Hossein Motlagh, Sasu Tarkoma
First: 2026-05-25T10:58:53+00:00 · Latest: 2026-05-25T10:58:53+00:00
Comments: 35 pages, 12 figures. Combined main paper and electronic supplement, folded into one document for arXiv
Abstract
Large language models (LLMs) can serve as the semantic-matching engine of a content-based publish/subscribe broker for agentic AI across the edge-cloud computing continuum, bridging the vocabulary and modality gaps that defeat keyword and embedding filters. Framed as offline multi-label retrieval over three public datasets spanning social-media, legal, and smart-home sensor domains (six LLMs, seven baselines), our central contribution is a two-crossover cost-accuracy characterisation: an analytical context-window crossover below which a CoverAndMerge compression pipeline reduces LLM invocations, and an empirical discrimination-capacity crossover above which matching accuracy collapses independently of context budget, by a model-dependent factor of parameter count and training generation. Two findings carry practical weight: above the discrimination crossover, compression cannot recover accuracy and only frontier-scale models clear large subscription sets; and there backend choice dominates configuration choice, so model selection, not pipeline tuning, is the primary operator lever. We accompany this with three composable algorithms and a per-cluster Quality-of-Experience framework for autonomic LLM-tier selection.
Summary / 总结
Large language models (LLMs) can serve as the semantic-matching engine of a content-based publish/subscribe broker for agentic AI across the edge-cloud computing continuum, bridging the vocabulary and modality gaps that defeat keyword and embedding filters.
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
Authors: Seungeun Oh, Jinhyuk Kim, Jihong Park, Seung-Woo Ko, Jinho Choi, Tony Q. S. Quek, Seong-Lyun Kim
First: 2025-05-17T02:10:34+00:00 · Latest: 2026-05-25T03:16:45+00:00
Comments: 17 pages, 13 figures, 5 tables; This article has been accepted for publication in IEEE Transactions on Communications. This is the author's accepted version; the final published version will be available via IEEE Xplore
Abstract
To support emerging language-based applications using dispersed and heterogeneous computing resources, the hybrid language model (HLM) offers a promising architecture, where an on-device small language model (SLM) generates draft tokens that are validated and corrected by a remote large language model (LLM). However, the original HLM suffers from substantial communication overhead, as the LLM requires the SLM to upload the full vocabulary distribution for each token. Moreover, both communication and computation resources are wasted when the LLM validates tokens that are highly likely to be accepted. To overcome these limitations, we propose communication-efficient and uncertainty-aware HLM (CU-HLM). In CU-HLM, the SLM transmits truncated vocabulary distributions only when its output uncertainty is high. We validate the feasibility of this opportunistic transmission by discovering a strong correlation between SLM's uncertainty and LLM's rejection probability. Furthermore, we theoretically derive optimal uncertainty thresholds and optimal vocabulary truncation strategies. Simulation results show that, compared to standard HLM, CU-HLM achieves up to 206$\times$ higher token throughput by skipping 74.8% transmissions with 97.4% vocabulary compression, while maintaining 97.4% accuracy.
Summary / 总结
To support emerging language-based applications using dispersed and heterogeneous computing resources, the hybrid language model (HLM) offers a promising architecture, where an on-device small language model (SLM) generates draft tokens that are validated and corrected by a remote large language model (LLM).
TIP: A Decentralized Intent-Based Protocol for Declarative IoT Interoperability and Sandboxed Schema Adaptation
Authors: Yeison David Mejia Mosquera
First: 2026-05-25T01:28:12+00:00 · Latest: 2026-05-25T01:28:12+00:00
Comments: 12 pages, 3 figures
Abstract
Heterogeneous Internet of Things (IoT) systems suffer from fragmentation across hardware architectures, networking stacks, and data serialization formats. Existing standards (such as MQTT, COAP, and DDS) rely on address-bound, imperative routing models that require hardcoded configurations and leave no flexibility for runtime schema translation. This paper presents TIP (The Intent Protocol), a decentralized, declarative network protocol. Instead of addressing specific physical endpoints, nodes submit abstract intents specifying desired capabilities, schemas, and Quality of Service (QoS) constraints. The TIP Engine resolves matching nodes using a hybrid discovery mechanism combining local multicast DNS (mDNS) with Kademlia Distributed Hash Tables (DHT). Selection is optimized via a multi-criteria scoring algorithm incorporating network latency, historical reputation, and contract compliance. Mismatched data representations are reconciled on-the-fly inside isolated WebAssembly (WASM) sandboxes compiled dynamically from TOML specifications. Security is enforced through Ed25519 signatures, X25519 key exchanges, and ChaCha20-Poly 1305 payload encryption. Evaluation of our reference implementation in Rust and C++ shows sub-millisecond translation overhead and robust resilience under industrial conditions.
Summary / 总结
Heterogeneous Internet of Things (IoT) systems suffer from fragmentation across hardware architectures, networking stacks, and data serialization formats.
Device Context Protocol: A Compact, Safety-First Architecture for LLM-Driven Control of Constrained Devices
Authors: Dongxu Yang
First: 2026-05-24T12:37:19+00:00 · Latest: 2026-05-24T12:37:19+00:00
Comments: 15 pages, 5 figures. Reference implementation, Python package (pip install pydcp), and reproduction scripts at https://github.com/device-context-protocol/dcp
Abstract
Large language models are increasingly used as orchestrators of external tools via the Model Context Protocol (MCP), but MCP is built for software services with megabytes of memory and does not descend to the microcontrollers that dominate the long tail of physical devices. Recent work (IoT-MCP) ports MCP to edge gateways at 74 KB peak memory; this still excludes the smallest commodity MCUs and, critically, does not address the safety problem of giving an unreliable caller (an LLM that may hallucinate or be prompt-injected) direct control of physical hardware. We present the Device Context Protocol (DCP): a sub-50-byte typical frame (6-byte header + CBOR payload + optional 16-byte HMAC), a manifest schema in which capability scoping, range and type checks, dry-run evaluation, and units-as-types are protocol-layer primitives, and a host-side Bridge that rejects malformed or hallucinated calls before any byte reaches the device. Reference firmware measures 27.6 KB flash / 0.6 KB RAM on ESP32; the Python Bridge, ESP32 firmware, and a language-neutral conformance suite are MIT-licensed and public. An empirical study -- 675 tool calls produced by five LLMs across four vendors (DeepSeek, Alibaba, Zhipu, MiniMax) against six categories of adversarial prompts, with the injection category instantiating AgentDojo's attack templates -- shows DCP rejects 100% of capability-escalation attempts and 78% of prompt-injection attempts, versus 0--1% for Raw MCP and IoT-MCP, matching the expressiveness of a well-formed OpenAPI 3 schema at three orders of magnitude less firmware footprint. We position DCP as the missing layer between MCP (which is moving toward enterprise SaaS connectivity) and the physical devices it does not reach.
Summary / 总结
Large language models are increasingly used as orchestrators of external tools via the Model Context Protocol (MCP), but MCP is built for software services with megabytes of memory and does not descend to the microcontrollers that dominate the long tail of physical devices.
Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning
Authors: Xuanting Xie, Zhaochen Guo, Bingheng Li, Xingtong Yu, Zhifei Liao, Zhao Kang, Yuan Fang
Venue: ICML 2026
First: 2026-05-24T04:58:44+00:00 · Latest: 2026-05-24T04:58:44+00:00
Comments: Accepted by ICML 2026
Abstract
Chain-of-Thought (CoT) prompting has shown promise in enhancing the reasoning capabilities of large language models (LLMs) on text-attributed graphs (TAGs). This work reframes CoT-based graph learning through the principle of clustering as reasoning, offering a $k$-means interpretation of how iterative reasoning operates over graph-structured data. We observe that existing graph CoT methods rely on disjoint architectures and fixed graph representations, limiting step-by-step semantic-topological interaction and interpretability. To overcome this limitation, we propose a unified framework named KCoT that integrates CoT reasoning with graph representation learning. Our key theoretical result reveals a formal mathematical correspondence between a Transformer block and the $k$-means algorithm, allowing reasoning to be interpreted as iterative assignment and update steps. Based on this insight, we introduce a Semantic Discriminating Prompt that explicitly formulates these steps as structured CoT reasoning, together with a structure-grounded alignment strategy to fuse topological priors with evolving thought-conditioned representations. Experiments on standard benchmarks demonstrate consistent improvements over state-of-the-art methods, validating clustering as a principled mechanism for CoT-based graph learning.
Summary / 总结
Chain-of-Thought (CoT) prompting has shown promise in enhancing the reasoning capabilities of large language models (LLMs) on text-attributed graphs (TAGs).
ReclaimNet: Reclaim-Aware Network Protocols for Voluntary GPU Sharing on Campus
Authors: Wenyang Jia, Jingjing Wang, Xianneng Zou, Kai Lei
First: 2026-05-23T22:00:35+00:00 · Latest: 2026-05-23T22:00:35+00:00
Abstract
University campuses host abundant but fragmented GPU resources whose voluntary sharing is blocked by a mismatch between revocable, autonomous ownership and migration mechanisms that assume stationary failure hazards, homogeneous interconnects, and unbounded transfer windows. We present ReclaimNet, a network-layer migration protocol suite that treats provider reclaim as a first-class contract rather than a failure case, combining three mechanisms: (i) reclaim-aware checkpoint scheduling that jointly adapts to time-varying departure hazards and contended bandwidth across co-resident jobs; (ii) volatility-aware destination selection integrating topology, survival probability, and notice-window feasibility; and (iii) deadline-aware migration traffic control with edge enforcement and a submillisecond TC BPF kill-switch. A two-month deployment on a 54-node heterogeneous campus testbed reduces work loss by 66% over Slurm preempt-and-requeue and 38% over pipeline-redundancy checkpointing, with 38% shorter downtime and under 3% degradation of background research traffic. The prototype is open-sourced at the anonymous repository https://anonymous.4open.science/r/ICNP2026-ReclaimNet/.
Summary / 总结
University campuses host abundant but fragmented GPU resources whose voluntary sharing is blocked by a mismatch between revocable, autonomous ownership and migration mechanisms that assume stationary failure hazards, homogeneous interconnects, and unbounded transfer windows.
ScaleAcross Explorer: Exploring Communication Optimization for Scale-Across AI Model Training
Authors: Minghao Li, Alicia Golden, Samuel Hsia, Michael Kuchnik, Adi Gangidi, Xu Zhang, Ashmitha Jeevaraj Shetty, Zachary DeVito, Weiwei Chu, Dong He, Haoci Zhang, Yuchen Hao, Ruoming Pang, James Hongyi Zeng, Ying Zhang, Minlan Yu, Carole-Jean Wu
First: 2026-05-23T01:11:19+00:00 · Latest: 2026-05-23T01:11:19+00:00
Comments: 28 pages, 27 figures
Abstract
The rapid scaling of large language model training requires distributing GPU resources across multiple data center buildings and regions. We refer to such paradigm as "scale-across" training. As infrastructure expands, the system design space becomes increasingly intricate, encompassing new model architectures, hardware heterogeneity, and evolving communication patterns. Drawing from Meta's production experience, we highlight the complexities of deploying training jobs across a few data centers housing hundreds of thousands of GPUs. To accelerate exploration of the large design space and to enable efficient training for frontier model development, we conduct in-depth characterization of three key design dimensions: parallelism placement, parallelism scheduling, and network layer technologies. We then propose ScaleAcross Explorer, an optimizer that considers the interplay of design dimensions and holistically optimizes scale-across training. Testbed experiments and simulations demonstrate up to 64.62% training speedups over production configuration and up to 37.59% training speedups over the state-of-the-art baseline across a wide range of design points.
Summary / 总结
The rapid scaling of large language model training requires distributing GPU resources across multiple data center buildings and regions.
Network Digital Twin for Congestion-Aware Predictive Traffic Routing using Graph MPNNs
Authors: Umer Iqbal, Ashiq Anjum, Anthony S Conway, Mathias Kern, Anasol Pena Rios
First: 2026-05-23T00:53:07+00:00 · Latest: 2026-05-23T00:53:07+00:00
Abstract
Telecom networks scale with growing users and data-intensive applications, generating heavy traffic that causes congestion, reducing throughput, increasing delay, and raising computational costs. Traditional routing protocols act only after performance degradation, making them unsuitable for dynamic traffic and topological changes. Addressing these challenges requires a routing approach that adapts in real time, scales with network growth, operates without disrupting active services, and provides continuous feedback for congestion-aware traffic optimisation. The Network Digital Twin (NDT) addresses these needs by mirroring global network behaviour using Message Passing Neural Networks (MPNNs) through bidirectional communication with the physical network. To align the NDT with physical network behaviour, synthetic traffic is generated with increasing load across topological structures that incrementally scale as routers are added. These topologies are created by graph-generating models such as Erdos-Renyi, Barabasi-Albert, and Watts-Strogatz, customised with vertex degree limitations. The NDT collects performance metrics from routers and links, and MPNNs classify edges based on local vertex and global network behaviours. Based on these classifications, feedback is sent as Policy-Based Routing (PBR) protocol commands to each router, enabling optimal traffic distribution across links of the physical network.
Summary / 总结
Telecom networks scale with growing users and data-intensive applications, generating heavy traffic that causes congestion, reducing throughput, increasing delay, and raising computational costs.
Adversarial Network Imagination: Causal LLMs and Digital Twins for Proactive Telecom Mitigation
Authors: Vignesh Sriram, Yuqiao Meng, Luoxi Tang, Zhaohan Xi
First: 2026-01-09T15:15:05+00:00 · Latest: 2026-05-22T20:00:54+00:00
Abstract
Telecommunication networks experience complex failures such as fiber cuts, traffic overloads, and cascading outages. Existing monitoring and digital twin systems are largely reactive, detecting failures only after service degradation occurs. We propose Adversarial Network Imagination, a closed-loop framework that integrates a Causal Large Language Model (LLM), a Knowledge Graph, and a Digital Twin to proactively generate, simulate, and evaluate adversarial network failures. The Causal LLM produces structured failure scenarios grounded in network dependencies encoded in the Knowledge Graph. These scenarios are executed within a Digital Twin to measure performance degradation and evaluate mitigation strategies. By iteratively refining scenarios based on simulation feedback, the framework shifts network operations from reactive troubleshooting toward anticipatory resilience analysis.
Summary / 总结
Telecommunication networks experience complex failures such as fiber cuts, traffic overloads, and cascading outages.
BShare: Packet Queueing Delay-Driven Buffer Sharing for Datacenter Switches
Authors: Krishna Agarwal, Muhamad Rizka Maulana, Vamsi Addanki, Habib Mostafaei
First: 2026-05-22T19:57:33+00:00 · Latest: 2026-05-22T19:57:33+00:00
Abstract
Modern datacenter switches share packet buffers across ports to boost overall throughput and reduce packet loss. However, as buffer availability per-port-per-bandwidth unit continues to decrease, existing buffer-sharing strategies face increasing performance challenges. Recent efforts have attempted to integrate Buffer Management (BM) with Active Queue Management (AQM) to harness the advantages of both BM and AQM approaches to improve performance. While these hybrid solutions show promise, their complexity of dynamically calculating multiple factors for integration hinders generalization and efficiency. This paper presents BShare, a simple buffer sharing mechanism that uses packet queueing delay. BShare requires only a single operator-configurable parameter. Our simulation results show that BSHARE improves the flow completion time (FCT) performance of advanced transport protocols, such as PowerTCP, by up to 45.07% compared to ABM, particularly under burst-heavy datacenter workloads.
Summary / 总结
Modern datacenter switches share packet buffers across ports to boost overall throughput and reduce packet loss.
EnCoR: An end-to-end architecture for simplifying cellular networks
Authors: Wesley Woo, Zhuowei Wen, Monniiesh Velmurugan, Richard Raad, Sylvia Ratnasamy, Scott Shenker, Shaddi Hasan
First: 2026-05-21T14:16:33+00:00 · Latest: 2026-05-22T19:50:13+00:00
Abstract
Since their creation, cellular networks have made in-network mobility support a key feature of their service model. While this approach provides seamless connectivity for legacy traffic, it has the side effects of inflating end-user latency and increasing complexity and operational overhead for operators. Yet modern applications and transport protocols are increasingly mobility tolerant, prompting us to revisit the assumption that mobility must be provided as an in-network service. In this paper, we propose EnCoR (End-to-End Core and RAN), a deployable cellular network architecture that removes mobility from the core entirely. Leveraging end-to-end mobility, EnCoR eliminates tunnel-based IP anchoring while preserving compatibility with existing authentication, charging, and QoS techniques. We demonstrate that EnCoR works with unmodified phones while providing equivalent performance as traditional LTE networks for real applications including video and voice calling and video streaming. We show that EnCoR not only allows network operators to reduce end to end latency, but can also reduce the capital cost of providing low latency service to users by more than 90% compared to 3GPP networks, based on cost estimates for cellular network core and border router infrastructure provided by the FCC. Finally, we demonstrate that these gains are achieved while reducing the amount of overall handover control messaging, allowing the EnCoR core network to handle a greater number of mobility handover events than an LTE core under identical hardware constraints, achieving a 2.6x lower handover latency under load.
Summary / 总结
Since their creation, cellular networks have made in-network mobility support a key feature of their service model.
XWind: A Cross-site Router for Large Language Model Inference Serving at Renewable Energy Farms
Authors: Tella Rajashekhar Reddy, Atharva Deshmukh, Liangcheng Yu, Chaojie Zhang, Mike Shepperd, Rohan Gandhi, Anjaly Parayil, Srinivasan Iyengar, Ajay Manchepalli, Debopam Bhattacherjee
First: 2026-05-22T08:08:47+00:00 · Latest: 2026-05-22T08:08:47+00:00
Abstract
AI power demand is growing at an unprecedented rate while power grids are often ailing and struggle to keep up. Grid expansion comes with high capital expenditure and long-distance transmission losses, yet there is abundant renewable energy at the source, just not matched to demand.
This paper proposes a complementary AI infrastructure deployment model, AI Greenferencing, that brings modular AI compute to renewable energy sources, focusing on wind, allowing AI footprint expansion, generating local behind-the-meter demand for renewable sites, and helping ease the growing strain on power utilities. Our feasibility analysis shows that 890+ GW of wind capacity lies within 50 ms network round trip time of Azure data centers, and that site-wise right-sizing combined with spatial complementarity of wind energy keeps aggregate fleet utilization on par with traditional deployments.
To serve inference requests under variable wind power, we build XWind, a lightweight, reactive, and workload-agnostic AI inference router that uses only real-time signals: inference latency, KV-cache utilization, and queue depth, to dynamically configure sites and distribute requests. Evaluated on a real 64-GPU A100 testbed emulating three wind-powered sites with Azure production traces, XWind reduces P99 end-to-end latency by up to 52% over the strongest contender (also our idea) and by up to 98% over baselines such as power-capping and GPU idling, with consistent gains across workload types, load levels, and GPU generations.
Summary / 总结
AI power demand is growing at an unprecedented rate while power grids are often ailing and struggle to keep up.
Purification Strategy Optimization for Entanglement Routing in Quantum Networks
Authors: Javier Vecino Peñas, Ana Fernández-Vilas, Rebeca P. Díaz-Redondo, Sergio Gándara Gándara, Manuel Fernández-Veiga
First: 2026-05-22T07:45:50+00:00 · Latest: 2026-05-22T07:45:50+00:00
Comments: Accepted in IEEE qCCL'26
Abstract
Quantum networks rely on the efficient distribution of entanglement to enable long-distance quantum communication and information processing. A key challenge in these networks is the design of routing protocols capable of maintaining high quality entanglement in the presence of noise, decoherence, and imperfect operations, which progressively degrade the fidelity of entangled states through entanglement swapping. Entanglement purification provides an effective mechanism to mitigate this degradation at the cost of additional resources. In this work, we study purification-aware quantum routing and formulate the problem of selecting optimal purification strategies as an optimization task. By employing dynamic programming techniques, we identify strategies that optimally balance resource consumption and end-to-end fidelity, demonstrating the effectiveness of our approach across different scenarios.
Summary / 总结
Quantum networks rely on the efficient distribution of entanglement to enable long-distance quantum communication and information processing.
On the Performance of DCF in Full Duplex WLANs with Hidden Terminals
Authors: Anastasios C. Politis, Constantinos S. Hilas, Hristos T. Anastassiu
First: 2026-05-22T06:29:59+00:00 · Latest: 2026-05-22T06:29:59+00:00
Comments: 7 pages, 7 figures, 2022 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom)
Abstract
Full Duplex (FD) technology is considered as one of the next big leap in the evolution of modern WLANs. Allowing a node to simultaneously transmit a data frame while in receive mode, can theoretically double the system throughput. However, several requirements must be fulfilled in order for FD operation to manifest. One obvious prerequisite is that the Medium Access Control (MAC) mechanism must allow two nodes to access the shared medium simultaneously. In modern WLANs the standard MAC layer mechanism is the Distributed Coordination Function (DCF), which is specifically designed to avoid such situations. FD communications may also take place when the physical placement of the communicating parts involves the existence of hidden terminals which, in standard Half Duplex (HD) communications, imposes a significant problem. This paper investigates the performance of the Carrier Sense Multiple Access with Collision Avoidance (CSMA/CA) protocol, which constitutes the basis of the DCF mechanism, in FD WLANs with hidden terminals, and compares it with the standard HD case. Our analysis is based on performance modelling. Results indicate that, under the DCF regime, FD technology exhibits an exiguous performance improvement, in terms of saturation throughput, when compared with its half duplex counterpart.
Summary / 总结
Full Duplex (FD) technology is considered as one of the next big leap in the evolution of modern WLANs.
Relay-Based Synchronization of Replicated Data Types in Opportunistic Networks
Authors: Frédéric Guidec, Yves Mahéo
First: 2026-05-21T13:44:50+00:00 · Latest: 2026-05-21T13:44:50+00:00
Comments: 33 pages
Abstract
In Opportunistic Networks (OppNets), the dissemination of information can only rely on transient pairwise radio contacts between mobile devices (peers). Designing distributed applications that can run in such conditions is a challenge, but replicated data types, and in particular Conflict-free Replicated Data Types (CRDTs), can help meet this challenge. A CRDT is inherently replicated data type whose replicas can be updated locally, yet eventually converge thanks to an anti-entropy algorithm that allows all replicas to synchronize in the background. Whether the replicas of a CRDT can actually converge in an OppNet, and how fast they can converge, depend on the occurrence of radio contacts between mobile devices. In this paper we investigate the idea of using mobile relays as a means to boost the convergence of stated-based CRDT replicas in an OppNet. New protocols are presented that allow the synchronization of replicas and relays, and new metrics are defined to observe and characterize the convergence of replicas. Simulation results show that using relays can significantly improve this convergence, and even make it possible in scenarios where the replicas alone would be unable to converge.
Summary / 总结
In Opportunistic Networks (OppNets), the dissemination of information can only rely on transient pairwise radio contacts between mobile devices (peers).
SpaceMoE: Realizing Distributed Mixture-of-Experts Inference over Space Networks
Authors: Zhanwei Wang, Huiling Yang, Min Sheng, Khaled B. Letaief, Kaibin Huang
First: 2026-05-01T08:40:31+00:00 · Latest: 2026-05-21T12:32:36+00:00
Abstract
Leveraging continuous solar energy harvesting at high efficiency, space data centers are envisioned as a promising platform for executing energy-intensive large language models (LLMs). Recognizing this advantage, space and AI conglomerates (e.g., SpaceX, Google) are actively investing in this vision. One key challenge, however, is the efficient distributed deployment of a large-scale LLM in a satellite network due to the limited onboard computing and communication resources. This gives rise to a placement problem that involves partitioning and mapping model components to satellites such that the fundamentally different model architecture and network topology can be reconciled to ensure low-latency token generation. To address this problem, we present the Space Network of Mixture-of-Experts (SpaceMoE) framework targeting the distributed execution of a popular mixture-of-experts (MoE) model in space. The proposed placement strategies are two-level: (1) layer placement, which assigns MoE layers to satellite subnets; and (2) intra-layer expert placement, which assigns individual experts to satellites associated with the same layer/subnet. For layer placement, we exploit the ring-like communication pattern of autoregressive inference to partition the satellite constellation along the orbiting direction into subnets arranged on a ring, each hosting one MoE layer. Based on this architecture, we formulate and solve an optimization problem for intra-layer expert placement to map experts with heterogeneous activation probabilities onto satellites. The derived strategy reveals an intuitive principle: a frequently activated expert should be mapped to a satellite on a routing path with low expected latency. Experiments over a thousand-satellite constellation show that SpaceMoE achieves at least a threefold latency reduction compared with conventional random and ablation-based placement strategies.
Summary / 总结
Leveraging continuous solar energy harvesting at high efficiency, space data centers are envisioned as a promising platform for executing energy-intensive large language models (LLMs).
Eliminating Premature Termination in Multihop Rendezvous for Cognitive Radio-based Emergency Response Network
Authors: Zahid Ali, Saritha Unnikrishnan, Eoghan Furey, Ian McLoughlin, Saim Ghafoor
First: 2026-05-21T11:14:11+00:00 · Latest: 2026-05-21T11:14:11+00:00
Comments: Submitted to Results in Engineering, Elsevier
Abstract
In post-disaster environments, damaged communication infrastructure severely limits coordination among emergency response teams. Cognitive radio networks (CRNs) enable rapidly deployable communication by allowing nodes to opportunistically access available spectrum. However, existing multihop rendezvous protocols typically rely on N-1 termination conditions, which can lead to premature termination, resulting in incomplete neighbour discovery and invalid network topology formation. This work identifies this limitation as a previously overlooked issue in multihop rendezvous protocols. This paper proposes a Multihop Reliable Dual-Modular Clock Algorithm (MR-DMCA) that eliminates premature termination and ensures reliable network formation. The proposed protocol introduces a coordinate-assisted neighbour validation mechanism and an autonomous termination strategy that guarantees complete neighbour and topology discovery before protocol termination. Although implemented within MR-DMCA, the proposed validation and termination approach is applicable to a wider class of multihop rendezvous protocols. Extensive simulations demonstrate that, in a worst-case scalable scenario with 20 nodes and 20 channels under high primary radio activity (m=2), MR-DMCA achieves 100% accurate neighbour and topology discovery while reducing rendezvous time by up to 76%, 37%, and 17% compared with baseline protocols. The results highlight that addressing premature termination is critical for reliable multihop rendezvous in cognitive radiobased emergency communication networks.
Summary / 总结
In post-disaster environments, damaged communication infrastructure severely limits coordination among emergency response teams.
Throughput and Delay Performance of Slotted Aloha in SmartBANs under Saturation Conditions
Authors: Anastasios C. Politis, Constantinos S. Hilas
Venue: IEEE Networking Letters, vol. 6, no. 3, pp. 168-172, September 2024
First: 2026-05-21T11:02:27+00:00 · Latest: 2026-05-21T11:02:27+00:00
Comments: 5 pages, 5 figures
Abstract
This letter evaluates the performance of the slotted Aloha protocol defined by the European Telecommunication Standard Institute (ETSI) SmartBAN specification, under saturation conditions. For this purpose, we develop a two-dimensional Discrete Time Markov Chain (DTMC) to model the operational details of the protocol and assess its performance in terms of saturation throughput and average end-to-end delay. The accuracy of the proposed model is validated by means of simulation which reveals a very good match among theoretical and simulation results. The model can be used for protocol performance prediction and optimization purposes.
Summary / 总结
This letter evaluates the performance of the slotted Aloha protocol defined by the European Telecommunication Standard Institute (ETSI) SmartBAN specification, under saturation conditions.
Toward Realistic Wi-Fi Fault Diagnosis: A Multi-Modal Benchmark
Authors: Junjian Zhang, Haobo Deng, Xinxin Li, Ming Zhao, Fengxiao Tang, Nei Kato
First: 2026-05-21T05:09:15+00:00 · Latest: 2026-05-21T05:09:15+00:00
Abstract
Intelligent network operation and maintenance systems in modern networks continuously generate large volumes of multi-modal operational data. However, Wi-Fi fault diagnosis under heterogeneous operational environments remains insufficiently understood. We build a real-world Wi-Fi testbed deployed in campus working environments with an automated fault injection system, and collect a multi-modal Wi-Fi fault dataset containing over 10,000 fault samples across diverse wireless scenarios. To the best of our knowledge, this is among the first publicly available datasets jointly capturing heterogeneous cross-layer operational observations for Wi-Fi fault diagnosis. Based on this dataset, we establish a unified benchmark spanning multiple diagnosis tasks, operational modalities, and representative diagnosis paradigms. Experimental results indicate that effectively leveraging heterogeneous operational data remains challenging for existing diagnosis approaches. We further evaluate emerging LLM-based approaches and develop a reasoningoriented evaluation framework to assess the consistency between generated diagnostic analyses and actual network conditions. Our findings suggest several important considerations for future multi-modal Wi-Fi diagnosis.
Summary / 总结
Intelligent network operation and maintenance systems in modern networks continuously generate large volumes of multi-modal operational data.
RNG: Flat Datacenter Networks at Scale
Authors: Giacomo Bernardi, Ratul Mahajan, C. Seshadhri, Enrico Carlesso, Chinchu Merine Joseph, Saurabh Kumar, Pavan Manikonda, Luiza Popa, Randy Ram, Steven Robinson, Elizabeth Tennent
First: 2026-04-16T17:37:04+00:00 · Latest: 2026-05-21T04:26:09+00:00
Abstract
We design and deploy in production the first flat datacenter networks. Our design, called RNG, is based on quasi-random graphs. While the cost and fault-tolerance benefits of such topologies have been long known, their practical realization has been hampered by a lack of scalable routing and cabling approaches. RNG has a new distributed routing protocol that exploits the properties of random graphs to find a large number of edge disjoint paths between pairs of endpoints. It uses a novel passive optical device that internally shuffles cables, which makes its cabling complexity similar to that of fat trees. We show that RNG matches or exceeds the performance of fat trees for a range of traffic patterns, despite being up to 45% cheaper. RNG is now the default datacenter network for most workloads at Amazon.
Summary / 总结
We design and deploy in production the first flat datacenter networks.
An Open-Source Framework to Emulate Delay and Disruption Tolerant Networks for International Space Station Communication
Authors: Krit Grover, Marcelo Ponce
First: 2026-05-20T18:36:40+00:00 · Latest: 2026-05-20T18:36:40+00:00
Comments: To be presented at the "29th International Symposium on Real-Time Distributed Computing" ISORC 2026
Abstract
Delay and Disruption Tolerant Networks (DTN) are critical for reliable communications in challenged network environments, particularly for space systems where end-to-end connectivity cannot be guaranteed. We present an open-source, full-stack implementation of the Bundle protocol for communicating with the International Space Station (ISS), with complete security features including Bundle Authentication Block (BAB), Payload Integrity Block (PIB), and Payload Confidentiality Block (PCB) using HMAC-SHA256 and AES-256-CBC encryption. The system includes bundle fragmentation and reassembly, priority-based queuing, custody transfer with ACK/NAK mechanisms, and automatic retransmission. Our system also includes a frontend facilitated by a modern responsive web interface. We consider this work highly relevant in the context of computer networking because: i) it demonstrates a full stack, open-source, freely available implementation of this critical and reliable protocol; and ii) it offers an interactive educational and learning framework in the field of computer networks and communications.
Summary / 总结
Delay and Disruption Tolerant Networks (DTN) are critical for reliable communications in challenged network environments, particularly for space systems where end-to-end connectivity cannot be guaranteed.
Enhanced-BLE: A Hybrid BLE-ESB Framework for Dynamically Reconfigurable and Energy-Efficient 2.4 GHz IoT Communication
Authors: Ziyao Zhou, Chen Shen, Tiancheng Cao, Hen-Wei Huang
First: 2026-05-20T14:58:41+00:00 · Latest: 2026-05-20T14:58:41+00:00
Abstract
Bluetooth Low Energy (BLE) is widely used in IoT systems because of its low power consumption, interoperability, and reliable bidirectional communication. However, its connection-oriented architecture introduces trade-offs among wake-up latency, throughput, and energy efficiency, limiting its suitability for burst-mode and on-demand sensing applications. Enhanced ShockBurst (ESB), a lightweight connectionless protocol supported by the same 2.4 GHz Nordic Semiconductor hardware, enables fast wake-up and efficient data transmission, but does not provide BLE-level robustness for sustained bidirectional communication. This work systematically benchmarks BLE and ESB on a unified Nordic nRF54L15 platform and proposes Enhanced-BLE, a hybrid framework that integrates the two protocols to extend conventional BLE operation. Experimental results show that ESB nearly halves packet transmission time and energy compared with BLE, doubles the achievable forward throughput, and reduces wake-up latency and energy by nearly twentyfold during intermittent operation. However, ESB reverse transmission may suffer packet loss, whereas BLE maintains reliable bidirectional communication. Enhanced-BLE addresses this trade-off through adaptive radio scheduling and coexistence-aware connection management, combining ESB-based high-throughput forward transmission with BLE-based reliable reverse communication. The framework enables BLE-to-ESB handover within approximately 18 ms and restores BLE operation within 49 ms from standby mode. Enhanced-BLE also achieves approximately twofold higher forward throughput than BLE while reducing wake-up latency. These results demonstrate a practical and hardware-compatible strategy for low-latency, high-throughput, energy-efficient, and reliable 2.4 GHz IoT communication.
Summary / 总结
Bluetooth Low Energy (BLE) is widely used in IoT systems because of its low power consumption, interoperability, and reliable bidirectional communication.
High-speed Networking for Giga-Scale AI Factories
Authors: Sajy Khashab, Albert Gran Alcoz, Alon Gal, Jacky Romano, Rani Abboud, Yonatan Piasetzky, Lior Maman, Amit Nishry, Barak Gafni, Omer Shabtai, Matty Kadosh, Dror Goldenberg, Gilad Shainer, Mark Silberstein
First: 2026-05-20T13:52:47+00:00 · Latest: 2026-05-20T13:52:47+00:00
Abstract
As distributed model training scales to span hundreds of thousands of GPUs, scale-out networks face unprecedented performance and efficiency demands. NVIDIA Spectrum-X Ethernet has been designed from the ground up to achieve predictable and stable network performance with high utilization and low latency. This paper presents the Spectrum-X multiplane architecture, which replaces hierarchical depth with topological parallelism, and introduces hardware-accelerated load balancing in NICs and switches as the key architectural approach to provide fast reaction to highly dynamic network conditions at the microsecond timescales that AI training workloads demand. We describe the motivation, design principles, evaluation methodology and performance on state-of-the-art benchmarks, as well as the lessons we learned from deploying and debugging Spectrum-X networks in large-scale systems. Our evaluation highlights production-grade AI infrastructure performance across three core dimensions: 98% of the theoretical line rate with low jitter-free latency; strong cross-tenant isolation for concurrent workloads; robust, capacity-proportional bisection bandwidth and 7% latency increase for 10% fabric link failures; and rapid reaction to host and fabric link flaps during LLM training workloads.
Summary / 总结
As distributed model training scales to span hundreds of thousands of GPUs, scale-out networks face unprecedented performance and efficiency demands.
TrimCaching: Parameter-sharing Edge Caching for AI Model Downloading
Authors: Guanqiao Qu, Zheng Lin, Qian Chen, Jian Li, Fangming Liu, Xianhao Chen, Kaibin Huang
First: 2024-04-22T14:13:36+00:00 · Latest: 2026-05-20T02:47:49+00:00
Comments: 19 pages, 13 figures. Part of this work has been accepted by ICDCS 2024
Abstract
Next-generation mobile networks are expected to facilitate fast AI model downloading to end users. By caching models on edge servers, mobile networks can deliver models to end users with low latency, resulting in a paradigm of edge model caching. In this paper, we develop a novel model placement framework, called parameter-sharing model caching (TrimCaching). TrimCaching exploits the key observation that a wide range of AI models, such as convolutional neural networks or large language models, can share a significant proportion of parameter blocks containing reusable knowledge, thereby improving storage efficiency. To this end, we formulate a parameter-sharing model placement problem to maximize the cache hit ratio in multi-edge wireless networks by balancing the fundamental tradeoff between storage efficiency and service latency. We show that the formulated problem is a submodular maximization problem with submodular constraints, for which no polynomial-time approximation algorithm exists. To tackle this challenge, we study an important special case, where a small fixed number of parameter blocks are shared across models, which often holds in practice. In such a case, a polynomial-time algorithm with a $\left(1-ε\right)/2$-approximation guarantee is developed. Subsequently, we address the original problem for the general case by developing a greedy algorithm. Simulation results demonstrate that the proposed TrimCaching framework significantly improves the cache hit ratio compared with state-of-the-art content caching without exploiting shared parameters in AI models.
Summary / 总结
Next-generation mobile networks are expected to facilitate fast AI model downloading to end users.
Intent-First Aerial V2V for Tactical Coordination and Separation: Protocol and Performance Under Density and Disturbance
Authors: Mehrnaz Sabet
First: 2026-05-20T01:04:17+00:00 · Latest: 2026-05-20T01:04:17+00:00
Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems
Abstract
Dense low-altitude aerial operations require more than pre-flight route coordination and last-resort collision avoidance. Once aircraft are airborne, disturbances can emerge on timescales shorter than strategic reauthorization can absorb, while collision avoidance is too late and disruptive to serve as routine traffic management. Although tactical separation is recognized as the intermediate layer, realizing it at scale requires a deployable neighborhood communication mechanism that provides fresh, trusted information for local coordination. This paper presents what is, to our knowledge, the first controller-coupled characterization of an all-airborne, sidelink-class, intent-first vehicle-to-vehicle (V2V) tactical neighborhood exchange stack for dense Unmanned Aircraft System Traffic Management (UTM) operations. Unlike awareness-only broadcast, the proposed exchange combines refreshed state and intent beacons for local awareness, cooperative perception, and degraded-mode assessment with event-triggered messages for yielding, sequencing, release, and contingency coordination. We implement and evaluate this model on an all-airborne V2V stack using sidelink-class C-V2X modules with authenticated freshness checks. Evaluation uses a scenario-driven, high-volume stress campaign supported by real-time, field-anchored infrastructure. Results show that V2V reduces stale-belief divergence, preserves observability through cooperative perception, rejects invalid tactical messages, suppresses false local inference, and structures shared-resource coordination. The implemented stack provides a viable communication layer for tactical separation in lower-to-moderate regimes, but transitions toward guarded fallback as density, impairment, and complexity increase. These findings position intent-first aerial V2V as a bounded enabler for scaling tactical coordination in disturbance-driven urban airspace.
Summary / 总结
Dense low-altitude aerial operations require more than pre-flight route coordination and last-resort collision avoidance.
Detecting Data Exfiltration through I2P Anonymity Networks: A Two-Phase Machine Learning Approach
Authors: Siddique Abubakr Muntaka, Muntaka Mohammed, Mansuru Mikail Azindo, Ibrahim Tanko, Franco Osei-Wusu, Edward Danso Ansong, Benjamin Yankson, Oliver Kornyo, Foster Yeboah, Jones Yeboah, Richmond Adams, Pulcheria Serwaa
First: 2026-05-19T22:46:22+00:00 · Latest: 2026-05-19T22:46:22+00:00
Abstract
The Invisible Internet Project (I2P) provides strong anonymity through garlic routing and distributed network architecture, making it attractive for legitimate privacy needs. Nevertheless, the same properties can be exploited by malicious actors to steal sensitive information from corporate networks without detection. Current network security measures often fail to detect I2P traffic, and existing literature has focused primarily on protocol-level traffic identification without addressing behavioral threat assessment. This paper proposes a two-stage machine-learning model for I2P traffic analysis using the SafeSurf Darknet 2025 dataset comprising 184,548 network flows. Phase 1 achieved 99.96% accuracy in distinguishing I2P traffic from normal network traffic using a Random Forest classifier, with only 2 false positives among 32,318 normal flows. Phase 2 performed behavioral analysis on traffic identified as I2P, classifying it as either exfiltration or legitimate activity, achieving 91.11% accuracy using XGBoost. The system demonstrates that tree-based ensemble methods substantially outperform deep neural networks and support vector machines for this task. Feature importance analysis indicates that the most discriminative features are packet timing and flow duration. These findings establish that accurate I2P traffic detection and threat prioritization are achievable in operational network environments, enabling security teams to focus resources on high-risk events rather than monitoring all encrypted traffic.
Summary / 总结
The Invisible Internet Project (I2P) provides strong anonymity through garlic routing and distributed network architecture, making it attractive for legitimate privacy needs.
A Meshtastic-based LoRa Mesh System for Smart Campus Applications: From Solar-Powered Sensing to Containerized Data Management
Authors: Rafael Garzon Andosilla, José de Jesús Rugeles Uribe
Venue: www
First: 2026-05-19T18:30:48+00:00 · Latest: 2026-05-19T18:30:48+00:00
Comments: 14 pages, 4 figures, 5 tables.To appear in the proceedings of the 6th CATAÏ Workshop (Bogotá, Colombia, May 7-8, 2026). Workshop website: https://www.catai.fr/catai2026.html
Abstract
This work presents the design, implementation, and evaluation of a LoRa-based mesh network using the Meshtastic protocol for Smart Campus applications at Universidad Militar Nueva Granada (UMNG). The system integrates heterogeneous hardware nodes including a solar-powered ecological sensing node built around a Raspberry Pi Pico and a Semtech SX1262 transceiver, and mobile trackers based on the Seeed SenseCAP T1000-E managed through a containerized edge gateway running on a Raspberry Pi 4. A Docker Compose microservices stack handles data ingestion via Node-RED, time-series storage in InfluxDB, and real-time visualization through Grafana dashboards. The architecture's performance was evaluated under realistic propagation scenarios at the UMNG Cajicá campus, characterizing link quality using Received Signal Strength Indicator (RSSI) and Signal-to-Noise Ratio (SNR) metrics. Experimental results demonstrate robust mesh connectivity across key university facilities, including an extended-range link of approximately 2.47 km linking the campus gateway to a remote station at Mirador La Cumbre (n = 62 packets received, mean RSSI = -110 dBm, mean SNR = +2.75 dB). This architecture demonstrates that open-source mesh protocols combined with containerized microservices offer an autonomous, highly reproducible infrastructure for environmental monitoring and asset tracking, supporting the transition toward data-driven "Smart Campus" ecosystems without reliance on centralized commercial LoRaWAN operators.
Summary / 总结
This work presents the design, implementation, and evaluation of a LoRa-based mesh network using the Meshtastic protocol for Smart Campus applications at Universidad Militar Nueva Granada (UMNG).
Fair-Aurora: Comparing Fairness Strategies for Reinforcement Learning-Based Congestion Control in Multi-Flow Environments
Authors: Thomas Mbrice, Yuyu Liu
First: 2026-05-19T14:38:12+00:00 · Latest: 2026-05-19T14:38:12+00:00
Abstract
Reinforcement learning (RL) has emerged as a promising paradigm for Internet congestion control, achieving higher link utilization than classical heuristics. However, RL-based controllers trained in single-flow environments are not guaranteed to share bandwidth equitably when deployed in multi-flow networks. This paper investigates the fairness properties of Aurora~\cite{jay2019aurora}, a state-of-the-art deep RL congestion controller, and evaluates three post-hoc fairness strategies that preserve Aurora's RL architecture: \emph{reward shaping} (Strategy~A), \emph{observation augmentation} (Strategy~B), and \emph{loss-sensitivity tuning} (Strategy~C). Using a custom shared-bottleneck simulator and Jain's fairness index as the primary metric, we find that modest reward shaping achieves the best fairness while preserving aggregate throughput. All strategies maintain the total bandwidth budget with fairness being achieved through redistribution, not reduction. Beyond the 2-flow homogeneous setting, an extended evaluation across mixed Aurora--CUBIC competition and dynamic flow entry/exit scenarios shows that Strategy~C's loss-sensitivity emerges as the most TCP-friendly mechanism, while Strategy~B is the most stable through dynamic flow-set changes.
Summary / 总结
Reinforcement learning (RL) has emerged as a promising paradigm for Internet congestion control, achieving higher link utilization than classical heuristics.
Security Analysis of Bitcoin's V2 Transport Protocol: Exploiting Design Implications for Sustained Eclipse and Downgrade Attacks
Authors: Charmaine Ndolo, Florian Tschorsch
First: 2026-05-19T11:50:57+00:00 · Latest: 2026-05-19T11:50:57+00:00
Comments: 34 pages, 16 figures, 2 tables
Abstract
Bitcoin recently introduced a new protocol for the encryption of peer-to-peer (P2P) communication. The protocol, known as V2 P2P transport, represents a big step towards securing the overlay network against various previously-known attack vectors. Based on an analysis of V2 P2P transport, this work examines the current viability of said attacks and concludes that while they are now remediated, alternative attacks and paths to similar objectives exist. The identified shortcomings are conceptual (and not implementation bugs) and even applicable to other P2P networks. We show how a network-level attacker can identify application messages using the length of TCP payloads, can eclipse a target node by taking advantage of how encrypted communication channels work and can downgrade all of a node's connections to the unencrypted protocol by using the mechanisms designed for compatibility. We validate our contributions using a combination of network measurements, emulations and simulations. Finally, we propose a series of short-term and long-term countermeasures towards securing Bitcoin's P2P network. To the best of our knowledge, we are the first to study Bitcoin's security under V2 P2P transport.
Summary / 总结
Bitcoin recently introduced a new protocol for the encryption of peer-to-peer (P2P) communication.
How Helpful is LLM Assistance in Network Operations? A Case Study at a Large Demonstration Network
Authors: Ryo Nakamura, Koshi Eguchi
First: 2026-05-19T10:06:02+00:00 · Latest: 2026-05-19T10:06:02+00:00
Abstract
This paper reports on a real-world case study in which over 100 network engineers assessed how a Large Language Model (LLM) can assist in building and operating a network. The versatility of LLMs has accelerated their adoption across a wide range of domains, and assisting network operations is one such promising application. LLMs are probabilistic models, unlike deterministic protocols and configurations; therefore, clarifying their capabilities -- how and to what extent LLMs can help in network operations -- is a crucial step toward adopting LLMs. To offer practical insights into this issue, we conducted an extensive experiment on a large demonstration network built for a public exhibition, consisting of 21 racks with heterogeneous network devices. In the experiment, a total of 105 network engineers used an LLM-based chatbot while building and operating the network. The chatbot was equipped with three external functions: retrieval-augmented generation for domain-specific knowledge, CLI control of network devices running on the network, and access to a ticket system. The participants gave evaluations for the chatbot's responses on a best-effort basis. Analysis of the chat histories shows that 68.1% of the evaluations were positive, indicating a quantitative baseline of the LLM's helpfulness in network operations. Our results also demonstrate that understanding the capabilities of the chatbot is important for eliciting better responses. Moreover, we provide detailed use case analyses while sharing actual user--chatbot interactions.
Summary / 总结
This paper reports on a real-world case study in which over 100 network engineers assessed how a Large Language Model (LLM) can assist in building and operating a network.
Sample-Efficient Misconfiguration Classification for Network Resilience in Wireless Communications
Authors: Xin Hao, Chenhan Zhang, Massimo Piccardi, Vijaya Durga Chemalamarri, Qiwen Jiang, Wei Ni, Raymond Owen
First: 2026-05-19T03:29:28+00:00 · Latest: 2026-05-19T03:29:28+00:00
Abstract
As modern wireless communication networks grow increasingly complex, network outages driven by the inconsistency between dynamic topologies and protocol configurations have become a critical concern. To solve this issue, we mathematically formulate a protocol misconfiguration classification problem as a graph-based learning task and solve it with our proposed EtaGATv2 algorithm, an edge-type-aware graph attention network with dynamic attention. EtaGATv2 addresses two critical challenges: i) it captures non-uniform symptom propagation for protocol misconfiguration classification tasks, where certain network paths and nodes become critical for diagnosis, and ii) it extracts protocol-specific features from heterogeneous routing protocols with distinct message-passing behaviors by utilizing edge-type-aware transformations. Experiments across diverse and real-world topologies demonstrate that EtaGATv2 reaches state-of-the-art performance with 50% of the training samples, making it particularly suitable for networks with dynamic topologies and limited negative-labeled data.
Summary / 总结
As modern wireless communication networks grow increasingly complex, network outages driven by the inconsistency between dynamic topologies and protocol configurations have become a critical concern.
Enabling Agile Ambient IoT Networking via a Parameterized Hybrid Radio
Authors: Jiazhen Lei, Fengyuan Zhu, Tianze Cao, Yuxin Sha, Linling Zhong, Wenhui Li, Bingbing Wang, Zeming Yang, Jinyang Sun, Yibin Deng, Xiaohua Tian
First: 2026-05-18T12:31:30+00:00 · Latest: 2026-05-18T12:31:30+00:00
Comments: 14 pages, 23 figures
Abstract
The emergence of Ambient IoT signals a paradigm shift toward massive batteryless networking. However, the absence of an agile physical layer substrate remains a fundamental barrier to research and standardization. Current testbeds are hindered by decoupled radio paths, high static power, and cumbersome control methods, which stifle rapid protocol prototyping. In this paper, we present Janus, the first hybrid active-passive configurable radio architected for agile Ambient IoT networking. Janus introduces a parameterized architecture that unifies passive and active transmission into a single RF front end, abstracting complex physical layer behaviors into concise parameters. This design enables a system-level control plane for dynamic mode transitions and an energy management plane for fine-grained harvesting across multiple sources. We implement a compact PCB prototype and evaluate its performance across diverse protocol landscapes, including 3GPP A-IoT, IEEE 802.11 AMP, and Bluetooth SIG. Our experimental results demonstrate that Janus achieves communication performance on par with dedicated radios while significantly reducing configuration overhead. Ultimately, Janus serves as a versatile enabler for validating emerging protocols and accelerating the standardization of next-generation low-power networks.
Summary / 总结
The emergence of Ambient IoT signals a paradigm shift toward massive batteryless networking.
ZeroSiam: An Efficient Asymmetry for Test-Time Entropy Optimization without Collapse
Authors: Guohao Chen, Shuaicheng Niu, Deyu Chen, Jiahao Yang, Zitian Zhang, Mingkui Tan, Pengcheng Wu, Zhiqi Shen
First: 2025-09-27T08:37:47+00:00 · Latest: 2026-05-18T11:34:53+00:00
Abstract
Test-time entropy minimization helps adapt a model to novel environments and incentivize its reasoning capability, unleashing the model's potential during inference by allowing it to evolve and improve in real-time using its own predictions, achieving promising performance. However, pure entropy minimization can favor non-generalizable shortcuts, such as inflating the logit norm and driving all predictions to a dominant class to reduce entropy, risking collapsed solutions (e.g., constant one-hot outputs) that trivially minimize the objective without meaningful learning. In this paper, we reveal asymmetry as a key mechanism for collapse prevention and introduce ZeroSiam--an efficient asymmetric Siamese architecture tailored for test-time entropy minimization. ZeroSiam prevents collapse through asymmetric divergence alignment, efficiently achieved by a learnable predictor and a stop-gradient operator before the classifier. We provide empirical and theoretical evidence that ZeroSiam not only prevents collapse, but also regularizes biased learning signals, enhancing performance even when no collapse occurs. Despite its simplicity, extensive results show that ZeroSiam performs more stably over prior methods using negligible overhead, demonstrating efficacy on both vision adaptation and large language model reasoning tasks across challenging test scenarios and diverse models, including particularly collapse-prone tiny models.
Summary / 总结
Test-time entropy minimization helps adapt a model to novel environments and incentivize its reasoning capability, unleashing the model's potential during inference by allowing it to evolve and improve in real-time using its own predictions, achieving promising performance.