Implementation and Performance Optimization of a DPDK Packet Gateway on Manycore CPUs

Daisuke Sugisawa

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Since approximately 2005, major processor manufacturers have shifted their architectural focus from instruction-level parallelism (ILP) toward multicore and manycore parallelism to achieve higher performance.Rather than relying on deeper pipelines and speculative execution, performance gains have increasingly been realized through thread-level parallelism (TLP).Consequently, the responsibility for efficiently utilizing processor resources has transitioned from hardware mechanisms to software implementations. This technical note examines design strategies for achieving deterministic, high-throughput packet processing on manycore architectures using the Data Plane Development Kit (DPDK).It presents a simplified Packet Gateway (PGW) pipeline implementation, analyzing cache-coherence effects, NUMA-local memory allocation, and multicore scheduling patterns critical to maintaining per-packet processing budgets under nanosecond-level constraints.

Version published to 10.20944/preprints202510.1658.v2
Jan 19, 2026
Version published to 10.20944/preprints202510.1658.v1
Oct 21, 2025

Implementation and Evaluation of MemGuard in the Bao Hypervisor

This article has 2 authors:
1. Everaldo Gomes
2. Giovani Gracioli
This article has no evaluationsLatest version Jan 19, 2026
Proposal for expanding FPGA offloading targets for environment adaptive software

This article has 1 author:
1. Yoji Yamato
This article has no evaluationsLatest version Dec 18, 2025
Evaluating Why Processor Development Transitioned from Gigahertz Increases to Multicore

This article has 4 authors:
1. Oleh Savchak
2. Yaroslav Rovnianskyi
3. Tariq Eldakruri
4. Edip Senyurek
This article has no evaluationsLatest version Dec 21, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Implementation and Evaluation of MemGuard in the Bao Hypervisor

Proposal for expanding FPGA offloading targets for environment adaptive software

Evaluating Why Processor Development Transitioned from Gigahertz Increases to Multicore