Skip to content

Commit 20eb882

Browse files
edumazetnasty007
authored andcommitted
softirq: reduce latencies
commit c10d73671ad30f54692f7f69f0e09e75d3a8926a upstream. In various network workloads, __do_softirq() latencies can be up to 20 ms if HZ=1000, and 200 ms if HZ=100. This is because we iterate 10 times in the softirq dispatcher, and some actions can consume a lot of cycles. This patch changes the fallback to ksoftirqd condition to : - A time limit of 2 ms. - need_resched() being set on current task When one of this condition is met, we wakeup ksoftirqd for further softirq processing if we still have pending softirqs. Using need_resched() as the only condition can trigger RCU stalls, as we can keep BH disabled for too long. I ran several benchmarks and got no significant difference in throughput, but a very significant reduction of latencies (one order of magnitude) : In following bench, 200 antagonist "netperf -t TCP_RR" are started in background, using all available cpus. Then we start one "netperf -t TCP_RR", bound to the cpu handling the NIC IRQ (hard+soft) Before patch : # netperf -H 7.7.7.84 -t TCP_RR -T2,2 -- -k RT_LATENCY,MIN_LATENCY,MAX_LATENCY,P50_LATENCY,P90_LATENCY,P99_LATENCY,MEAN_LATENCY,STDDEV_LATENCY MIGRATED TCP REQUEST/RESPONSE TEST from 0.0.0.0 (0.0.0.0) port 0 AF_INET to 7.7.7.84 () port 0 AF_INET : first burst 0 : cpu bind RT_LATENCY=550110.424 MIN_LATENCY=146858 MAX_LATENCY=997109 P50_LATENCY=305000 P90_LATENCY=550000 P99_LATENCY=710000 MEAN_LATENCY=376989.12 STDDEV_LATENCY=184046.92 After patch : # netperf -H 7.7.7.84 -t TCP_RR -T2,2 -- -k RT_LATENCY,MIN_LATENCY,MAX_LATENCY,P50_LATENCY,P90_LATENCY,P99_LATENCY,MEAN_LATENCY,STDDEV_LATENCY MIGRATED TCP REQUEST/RESPONSE TEST from 0.0.0.0 (0.0.0.0) port 0 AF_INET to 7.7.7.84 () port 0 AF_INET : first burst 0 : cpu bind RT_LATENCY=40545.492 MIN_LATENCY=9834 MAX_LATENCY=78366 P50_LATENCY=33583 P90_LATENCY=59000 P99_LATENCY=69000 MEAN_LATENCY=38364.67 STDDEV_LATENCY=12865.26 Signed-off-by: Eric Dumazet <[email protected]> Cc: David Miller <[email protected]> Cc: Tom Herbert <[email protected]> Cc: Ben Hutchings <[email protected]> Signed-off-by: David S. Miller <[email protected]> [xr: Backported to 3.4: Adjust context] Signed-off-by: Rui Xiang <[email protected]> Signed-off-by: Zefan Li <[email protected]>
1 parent 98c0aa2 commit 20eb882

File tree

1 file changed

+9
-8
lines changed

1 file changed

+9
-8
lines changed

kernel/softirq.c

Lines changed: 9 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -194,21 +194,21 @@ void local_bh_enable_ip(unsigned long ip)
194194
EXPORT_SYMBOL(local_bh_enable_ip);
195195

196196
/*
197-
* We restart softirq processing MAX_SOFTIRQ_RESTART times,
198-
* and we fall back to softirqd after that.
197+
* We restart softirq processing for at most 2 ms,
198+
* and if need_resched() is not set.
199199
*
200-
* This number has been established via experimentation.
200+
* These limits have been established via experimentation.
201201
* The two things to balance is latency against fairness -
202202
* we want to handle softirqs as soon as possible, but they
203203
* should not be able to lock up the box.
204204
*/
205-
#define MAX_SOFTIRQ_RESTART 10
205+
#define MAX_SOFTIRQ_TIME msecs_to_jiffies(2)
206206

207207
asmlinkage void __do_softirq(void)
208208
{
209209
struct softirq_action *h;
210210
__u32 pending;
211-
int max_restart = MAX_SOFTIRQ_RESTART;
211+
unsigned long end = jiffies + MAX_SOFTIRQ_TIME;
212212
int cpu;
213213

214214
pending = local_softirq_pending();
@@ -255,11 +255,12 @@ asmlinkage void __do_softirq(void)
255255
local_irq_disable();
256256

257257
pending = local_softirq_pending();
258-
if (pending && --max_restart)
259-
goto restart;
258+
if (pending) {
259+
if (time_before(jiffies, end) && !need_resched())
260+
goto restart;
260261

261-
if (pending)
262262
wakeup_softirqd();
263+
}
263264

264265
lockdep_softirq_exit();
265266

0 commit comments

Comments
 (0)