Linux调度器性能分析

zoukankan html css js c++ java

Linux调度器性能分析
This article was firstly published from http://oliveryang.net. The content reuse need include the original link.

1. Scheduling perf profiling goals

For an OS scheduler implementation, there are 3 key features,

Time Sharing

Let multiple tasks share the CPU time fairly and efficiently.

Preemption

Important or latency sensitive tasks could be scheduled as quick as possible.

Load balance

Allow multiple tasks to share multiple CPU resources in system wide fairly and efficiently.

If system ran into any CPU scheduling perf problems, we would see one of above features might get broken. Our scheduling perf profiling goal is to understand how scheduler behaves from these 3 perspectives, under a certain workload or benchmark.

2. The major symptoms of scheduling perf issues

The symptoms of scheduling perf issues could be also classified by above 3 perspectives,

High or Low CPU utilization

Big task scheduling latency

Imbalance CPU utilization or scheduling latency

Please note that above symptoms might not always be caused by a kernel scheduler bug. For this reason, the most important thing is, we must define the performance problem with a clear baseline. With a clear baseline, we could have better efficiency to rule out different possibilities which have the similar symptoms.

3. The scheduling perf issues triage process

Different issues from hardware, kernel, or application level could cause the similar symptoms. For example, I used to handle the CPU high utilization bug caused by wrong MTRR(Memory Type Range Register) setting.

In another case, the scheduling domain workload imbalance was caused by a buggy ACPI SART table. In my examples, these issues might be easily identified by CPI(cycle per instruction) number reported by Linux perf or NUMAtop tools. However, if the problems comes from kernel or application, it can be very difficult to get the root cause, when we do not have the enough knowledge for that specific components.

As we always reported perf scheduling issues from specific type of workload or benchmark testing. The most efficient order
to triage scheduling performance bug is from top to bottom.

application -> kernel -> hypervisor -> hardware

One issues move from one layer to next layer, we must have technical justifications with following information,

The clear problem definitions with clear performance baseline

Why we think the problem is not in this layer

The performance tracing data or logs that support your analysis
查看全文

相关阅读:
The 2021 CCPC Guilin Onsite【A,I,G(二分),D（思维+构造）,E（最短路】
第 46 届 ICPC 国际大学生程序设计竞赛亚洲区域赛（沈阳）
The 2021 CCPC Weihai Onsite【G:组合数学（好模板） D:Exkmp利用Z数组跑后缀】
第 46 届 ICPC 国际大学生程序设计竞赛亚洲区域赛（上海）【I：线性DP】
项目中基于Rest的Wcf服务发布以及iBatisNet框架的使用(下)
linq to xml 学习整理
 项目中基于Rest的Wcf服务发布以及iBatisNet框架的使用(上)
java 工具PDF使用——itextpdf
在 jQuery 中弹出层中如何实现点击空白处关闭弹出层
 人生，有时候就是这样

原文地址：https://www.cnblogs.com/ainima/p/6330789.html

Linux调度器性能分析

1. Scheduling perf profiling goals

2. The major symptoms of scheduling perf issues

3. The scheduling perf issues triage process