ACPI / LPSS: Support for device latency tolerance PM QoS
[linux-block.git] / Documentation / power / pm_qos_interface.txt
CommitLineData
bf1db69f 1PM Quality Of Service Interface.
d82b3518
MG
2
3This interface provides a kernel and user mode interface for registering
4performance expectations by drivers, subsystems and user space applications on
5one of the parameters.
6
e3cba324
JP
7Two different PM QoS frameworks are available:
81. PM QoS classes for cpu_dma_latency, network_latency, network_throughput.
92. the per-device PM QoS framework provides the API to manage the per-device latency
d30b82a4 10constraints and PM QoS flags.
d82b3518 11
bf1db69f
RH
12Each parameters have defined units:
13 * latency: usec
14 * timeout: usec
15 * throughput: kbs (kilo bit / sec)
16
e3cba324
JP
17
181. PM QoS framework
19
d82b3518
MG
20The infrastructure exposes multiple misc device nodes one per implemented
21parameter. The set of parameters implement is defined by pm_qos_power_init()
22and pm_qos_params.h. This is done because having the available parameters
23being runtime configurable or changeable from a driver was seen as too easy to
24abuse.
25
ed77134b 26For each parameter a list of performance requests is maintained along with
d82b3518 27an aggregated target value. The aggregated target value is updated with
ed77134b
MG
28changes to the request list or elements of the list. Typically the
29aggregated target value is simply the max or min of the request values held
d82b3518 30in the parameter list elements.
e3cba324
JP
31Note: the aggregated target value is implemented as an atomic variable so that
32reading the aggregated value does not require any locking mechanism.
33
d82b3518
MG
34
35From kernel mode the use of this interface is simple:
d82b3518 36
e3cba324
JP
37void pm_qos_add_request(handle, param_class, target_value):
38Will insert an element into the list for that identified PM QoS class with the
ed77134b
MG
39target value. Upon change to this list the new target is recomputed and any
40registered notifiers are called only if the target value is now different.
e3cba324
JP
41Clients of pm_qos need to save the returned handle for future use in other
42pm_qos API functions.
d82b3518 43
ed77134b
MG
44void pm_qos_update_request(handle, new_target_value):
45Will update the list element pointed to by the handle with the new target value
46and recompute the new aggregated target, calling the notification tree if the
47target is changed.
48
49void pm_qos_remove_request(handle):
50Will remove the element. After removal it will update the aggregate target and
51call the notification tree if the target was changed as a result of removing
52the request.
d82b3518 53
e3cba324
JP
54int pm_qos_request(param_class):
55Returns the aggregated value for a given PM QoS class.
56
57int pm_qos_request_active(handle):
58Returns if the request is still active, i.e. it has not been removed from a
59PM QoS class constraints list.
60
61int pm_qos_add_notifier(param_class, notifier):
62Adds a notification callback function to the PM QoS class. The callback is
63called when the aggregated value for the PM QoS class is changed.
64
65int pm_qos_remove_notifier(int param_class, notifier):
66Removes the notification callback function for the PM QoS class.
67
d82b3518
MG
68
69From user mode:
ed77134b
MG
70Only processes can register a pm_qos request. To provide for automatic
71cleanup of a process, the interface requires the process to register its
72parameter requests in the following way:
d82b3518
MG
73
74To register the default pm_qos target for the specific parameter, the process
75must open one of /dev/[cpu_dma_latency, network_latency, network_throughput]
76
77As long as the device node is held open that process has a registered
ed77134b 78request on the parameter.
d82b3518 79
ed77134b
MG
80To change the requested target value the process needs to write an s32 value to
81the open device node. Alternatively the user mode program could write a hex
82string for the value using 10 char long format e.g. "0x12345678". This
83translates to a pm_qos_update_request call.
d82b3518
MG
84
85To remove the user mode request for a target value simply close the device
86node.
87
88
d30b82a4
T
892. PM QoS per-device latency and flags framework
90
2d984ad1
RW
91For each device, there are three lists of PM QoS requests. Two of them are
92maintained along with the aggregated targets of resume latency and active
93state latency tolerance (in microseconds) and the third one is for PM QoS flags.
94Values are updated in response to changes of the request list.
d30b82a4 95
2d984ad1
RW
96The target values of resume latency and active state latency tolerance are
97simply the minimum of the request values held in the parameter list elements.
98The PM QoS flags aggregate value is a gather (bitwise OR) of all list elements'
99values. Two device PM QoS flags are defined currently: PM_QOS_FLAG_NO_POWER_OFF
100and PM_QOS_FLAG_REMOTE_WAKEUP.
e3cba324 101
2d984ad1
RW
102Note: The aggregated target values are implemented in such a way that reading
103the aggregated value does not require any locking mechanism.
e3cba324
JP
104
105
106From kernel mode the use of this interface is the following:
107
ae0fb4b7 108int dev_pm_qos_add_request(device, handle, type, value):
e3cba324
JP
109Will insert an element into the list for that identified device with the
110target value. Upon change to this list the new target is recomputed and any
111registered notifiers are called only if the target value is now different.
112Clients of dev_pm_qos need to save the handle for future use in other
113dev_pm_qos API functions.
114
115int dev_pm_qos_update_request(handle, new_value):
116Will update the list element pointed to by the handle with the new target value
117and recompute the new aggregated target, calling the notification trees if the
118target is changed.
119
120int dev_pm_qos_remove_request(handle):
121Will remove the element. After removal it will update the aggregate target and
122call the notification trees if the target was changed as a result of removing
123the request.
124
125s32 dev_pm_qos_read_value(device):
126Returns the aggregated value for a given device's constraints list.
127
d30b82a4
T
128enum pm_qos_flags_status dev_pm_qos_flags(device, mask)
129Check PM QoS flags of the given device against the given mask of flags.
130The meaning of the return values is as follows:
131 PM_QOS_FLAGS_ALL: All flags from the mask are set
132 PM_QOS_FLAGS_SOME: Some flags from the mask are set
133 PM_QOS_FLAGS_NONE: No flags from the mask are set
134 PM_QOS_FLAGS_UNDEFINED: The device's PM QoS structure has not been
135 initialized or the list of requests is empty.
136
137int dev_pm_qos_add_ancestor_request(dev, handle, value)
138Add a PM QoS request for the first direct ancestor of the given device whose
139power.ignore_children flag is unset.
140
141int dev_pm_qos_expose_latency_limit(device, value)
b02f6695
RW
142Add a request to the device's PM QoS list of resume latency constraints and
143create a sysfs attribute pm_qos_resume_latency_us under the device's power
144directory allowing user space to manipulate that request.
d30b82a4
T
145
146void dev_pm_qos_hide_latency_limit(device)
147Drop the request added by dev_pm_qos_expose_latency_limit() from the device's
b02f6695
RW
148PM QoS list of resume latency constraints and remove sysfs attribute
149pm_qos_resume_latency_us from the device's power directory.
d30b82a4
T
150
151int dev_pm_qos_expose_flags(device, value)
152Add a request to the device's PM QoS list of flags and create sysfs attributes
153pm_qos_no_power_off and pm_qos_remote_wakeup under the device's power directory
154allowing user space to change these flags' value.
155
156void dev_pm_qos_hide_flags(device)
157Drop the request added by dev_pm_qos_expose_flags() from the device's PM QoS list
158of flags and remove sysfs attributes pm_qos_no_power_off and pm_qos_remote_wakeup
159under the device's power directory.
e3cba324
JP
160
161Notification mechanisms:
162The per-device PM QoS framework has 2 different and distinct notification trees:
163a per-device notification tree and a global notification tree.
164
165int dev_pm_qos_add_notifier(device, notifier):
166Adds a notification callback function for the device.
167The callback is called when the aggregated value of the device constraints list
b02f6695 168is changed (for resume latency device PM QoS only).
e3cba324
JP
169
170int dev_pm_qos_remove_notifier(device, notifier):
171Removes the notification callback function for the device.
172
173int dev_pm_qos_add_global_notifier(notifier):
174Adds a notification callback function in the global notification tree of the
175framework.
b02f6695
RW
176The callback is called when the aggregated value for any device is changed
177(for resume latency device PM QoS only).
e3cba324
JP
178
179int dev_pm_qos_remove_global_notifier(notifier):
180Removes the notification callback function from the global notification tree
181of the framework.
2d984ad1
RW
182
183
184Active state latency tolerance
185
186This device PM QoS type is used to support systems in which hardware may switch
187to energy-saving operation modes on the fly. In those systems, if the operation
188mode chosen by the hardware attempts to save energy in an overly aggressive way,
189it may cause excess latencies to be visible to software, causing it to miss
190certain protocol requirements or target frame or sample rates etc.
191
192If there is a latency tolerance control mechanism for a given device available
193to software, the .set_latency_tolerance callback in that device's dev_pm_info
194structure should be populated. The routine pointed to by it is should implement
195whatever is necessary to transfer the effective requirement value to the
196hardware.
197
198Whenever the effective latency tolerance changes for the device, its
199.set_latency_tolerance() callback will be executed and the effective value will
200be passed to it. If that value is negative, which means that the list of
201latency tolerance requirements for the device is empty, the callback is expected
202to switch the underlying hardware latency tolerance control mechanism to an
203autonomous mode if available. If that value is PM_QOS_LATENCY_ANY, in turn, and
204the hardware supports a special "no requirement" setting, the callback is
205expected to use it. That allows software to prevent the hardware from
206automatically updating the device's latency tolerance in response to its power
207state changes (e.g. during transitions from D3cold to D0), which generally may
208be done in the autonomous latency tolerance control mode.
209
210If .set_latency_tolerance() is present for the device, sysfs attribute
211pm_qos_latency_tolerance_us will be present in the devivce's power directory.
212Then, user space can use that attribute to specify its latency tolerance
213requirement for the device, if any. Writing "any" to it means "no requirement,
214but do not let the hardware control latency tolerance" and writing "auto" to it
215allows the hardware to be switched to the autonomous mode if there are no other
216requirements from the kernel side in the device's list.
217
218Kernel code can use the functions described above along with the
219DEV_PM_QOS_LATENCY_TOLERANCE device PM QoS type to add, remove and update
220latency tolerance requirements for devices.