crc/test: fix potential div by zero
[fio.git] / fio.1
CommitLineData
65f3c785 1.TH fio 1 "October 2013" "User Manual"
d60e92d1
AC
2.SH NAME
3fio \- flexible I/O tester
4.SH SYNOPSIS
5.B fio
6[\fIoptions\fR] [\fIjobfile\fR]...
7.SH DESCRIPTION
8.B fio
9is a tool that will spawn a number of threads or processes doing a
10particular type of I/O action as specified by the user.
11The typical use of fio is to write a job file matching the I/O load
12one wants to simulate.
13.SH OPTIONS
14.TP
49da1240
JA
15.BI \-\-debug \fR=\fPtype
16Enable verbose tracing of various fio actions. May be `all' for all types
17or individual types separated by a comma (eg \-\-debug=io,file). `help' will
18list all available tracing options.
19.TP
d60e92d1
AC
20.BI \-\-output \fR=\fPfilename
21Write output to \fIfilename\fR.
22.TP
b2cecdc2 23.BI \-\-runtime \fR=\fPruntime
24Limit run time to \fIruntime\fR seconds.
d60e92d1
AC
25.TP
26.B \-\-latency\-log
27Generate per-job latency logs.
28.TP
29.B \-\-bandwidth\-log
30Generate per-job bandwidth logs.
31.TP
32.B \-\-minimal
d1429b5c 33Print statistics in a terse, semicolon-delimited format.
d60e92d1 34.TP
2b8c71b0
CE
35.B \-\-append-terse
36Print statistics in selected mode AND terse, semicolon-delimited format.
37.TP
49da1240
JA
38.B \-\-version
39Display version information and exit.
40.TP
065248bf 41.BI \-\-terse\-version \fR=\fPversion
4d658652 42Set terse version output format (Current version 3, or older version 2).
49da1240
JA
43.TP
44.B \-\-help
45Display usage information and exit.
46.TP
fec0f21c
JA
47.B \-\-cpuclock-test
48Perform test and validation of internal CPU clock
49.TP
50.BI \-\-crctest[\fR=\fPtest]
51Test the speed of the builtin checksumming functions. If no argument is given,
52all of them are tested. Or a comma separated list can be passed, in which
53case the given ones are tested.
54.TP
49da1240
JA
55.BI \-\-cmdhelp \fR=\fPcommand
56Print help information for \fIcommand\fR. May be `all' for all commands.
57.TP
de890a1e
SL
58.BI \-\-enghelp \fR=\fPioengine[,command]
59List all commands defined by \fIioengine\fR, or print help for \fIcommand\fR defined by \fIioengine\fR.
60.TP
d60e92d1
AC
61.BI \-\-showcmd \fR=\fPjobfile
62Convert \fIjobfile\fR to a set of command-line options.
63.TP
d60e92d1
AC
64.BI \-\-eta \fR=\fPwhen
65Specifies when real-time ETA estimate should be printed. \fIwhen\fR may
66be one of `always', `never' or `auto'.
67.TP
30b5d57f
JA
68.BI \-\-eta\-newline \fR=\fPtime
69Force an ETA newline for every `time` period passed.
70.TP
71.BI \-\-status\-interval \fR=\fPtime
72Report full output status every `time` period passed.
73.TP
49da1240
JA
74.BI \-\-readonly
75Turn on safety read-only checks, preventing any attempted write.
76.TP
c0a5d35e 77.BI \-\-section \fR=\fPsec
cf145d90 78Only run section \fIsec\fR from job file. This option can be used multiple times to add more sections to run.
c0a5d35e 79.TP
49da1240
JA
80.BI \-\-alloc\-size \fR=\fPkb
81Set the internal smalloc pool size to \fIkb\fP kilobytes.
d60e92d1 82.TP
49da1240
JA
83.BI \-\-warnings\-fatal
84All fio parser warnings are fatal, causing fio to exit with an error.
9183788d 85.TP
49da1240 86.BI \-\-max\-jobs \fR=\fPnr
57e118a2 87Set the maximum allowed number of jobs (threads/processes) to support.
d60e92d1 88.TP
49da1240
JA
89.BI \-\-server \fR=\fPargs
90Start a backend server, with \fIargs\fP specifying what to listen to. See client/server section.
f57a9c59 91.TP
49da1240
JA
92.BI \-\-daemonize \fR=\fPpidfile
93Background a fio server, writing the pid to the given pid file.
94.TP
95.BI \-\-client \fR=\fPhost
96Instead of running the jobs locally, send and run them on the given host.
f2a2ce0e
HL
97.TP
98.BI \-\-idle\-prof \fR=\fPoption
99Report cpu idleness on a system or percpu basis (\fIoption\fP=system,percpu) or run unit work calibration only (\fIoption\fP=calibrate).
d60e92d1
AC
100.SH "JOB FILE FORMAT"
101Job files are in `ini' format. They consist of one or more
102job definitions, which begin with a job name in square brackets and
103extend to the next job name. The job name can be any ASCII string
104except `global', which has a special meaning. Following the job name is
105a sequence of zero or more parameters, one per line, that define the
106behavior of the job. Any line starting with a `;' or `#' character is
d1429b5c 107considered a comment and ignored.
d9956b64
AC
108.P
109If \fIjobfile\fR is specified as `-', the job file will be read from
110standard input.
d60e92d1
AC
111.SS "Global Section"
112The global section contains default parameters for jobs specified in the
113job file. A job is only affected by global sections residing above it,
114and there may be any number of global sections. Specific job definitions
115may override any parameter set in global sections.
116.SH "JOB PARAMETERS"
117.SS Types
b470a02c
SC
118Some parameters may take arguments of a specific type.
119Anywhere a numeric value is required, an arithmetic expression may be used,
d59aa780
JA
120provided it is surrounded by parentheses. Supported operators are:
121.RS
122.RS
123.TP
124.B addition (+)
125.TP
126.B subtraction (-)
127.TP
128.B multiplication (*)
129.TP
130.B division (/)
131.TP
132.B modulus (%)
133.TP
134.B exponentiation (^)
135.RE
136.RE
137.P
138For time values in expressions, units are microseconds by default. This is
139different than for time values not in expressions (not enclosed in
140parentheses). The types used are:
d60e92d1
AC
141.TP
142.I str
143String: a sequence of alphanumeric characters.
144.TP
145.I int
d60e92d1 146SI integer: a whole number, possibly containing a suffix denoting the base unit
b09da8fa
JA
147of the value. Accepted suffixes are `k', 'M', 'G', 'T', and 'P', denoting
148kilo (1024), mega (1024^2), giga (1024^3), tera (1024^4), and peta (1024^5)
74454ce4
CE
149respectively. If prefixed with '0x', the value is assumed to be base 16
150(hexadecimal). A suffix may include a trailing 'b', for instance 'kb' is
151identical to 'k'. You can specify a base 10 value by using 'KiB', 'MiB','GiB',
152etc. This is useful for disk drives where values are often given in base 10
153values. Specifying '30GiB' will get you 30*1000^3 bytes.
154When specifying times the default suffix meaning changes, still denoting the
155base unit of the value, but accepted suffixes are 'D' (days), 'H' (hours), 'M'
0de5b26f
JA
156(minutes), 'S' Seconds, 'ms' (or msec) milli seconds, 'us' (or 'usec') micro
157seconds. Time values without a unit specify seconds.
74454ce4 158The suffixes are not case sensitive.
d60e92d1
AC
159.TP
160.I bool
161Boolean: a true or false value. `0' denotes false, `1' denotes true.
162.TP
163.I irange
164Integer range: a range of integers specified in the format
d1429b5c
AC
165\fIlower\fR:\fIupper\fR or \fIlower\fR\-\fIupper\fR. \fIlower\fR and
166\fIupper\fR may contain a suffix as described above. If an option allows two
167sets of ranges, they are separated with a `,' or `/' character. For example:
168`8\-8k/8M\-4G'.
83349190
YH
169.TP
170.I float_list
171List of floating numbers: A list of floating numbers, separated by
cecbfd47 172a ':' character.
d60e92d1
AC
173.SS "Parameter List"
174.TP
175.BI name \fR=\fPstr
d9956b64 176May be used to override the job name. On the command line, this parameter
d60e92d1
AC
177has the special purpose of signalling the start of a new job.
178.TP
179.BI description \fR=\fPstr
180Human-readable description of the job. It is printed when the job is run, but
181otherwise has no special purpose.
182.TP
183.BI directory \fR=\fPstr
184Prefix filenames with this directory. Used to place files in a location other
185than `./'.
bcbfeefa
CE
186You can specify a number of directories by separating the names with a ':'
187character. These directories will be assigned equally distributed to job clones
188creates with \fInumjobs\fR as long as they are using generated filenames.
189If specific \fIfilename(s)\fR are set fio will use the first listed directory,
190and thereby matching the \fIfilename\fR semantic which generates a file each
67445b63
JA
191clone if not specified, but let all clones use the same if set. See
192\fIfilename\fR for considerations regarding escaping certain characters on
193some platforms.
d60e92d1
AC
194.TP
195.BI filename \fR=\fPstr
196.B fio
197normally makes up a file name based on the job name, thread number, and file
d1429b5c 198number. If you want to share files between threads in a job or several jobs,
de890a1e
SL
199specify a \fIfilename\fR for each of them to override the default.
200If the I/O engine is file-based, you can specify
d1429b5c
AC
201a number of files by separating the names with a `:' character. `\-' is a
202reserved name, meaning stdin or stdout, depending on the read/write direction
67445b63
JA
203set. On Windows, disk devices are accessed as \\.\PhysicalDrive0 for the first
204device, \\.\PhysicalDrive1 for the second etc. Note: Windows and FreeBSD
205prevent write access to areas of the disk containing in-use data
206(e.g. filesystems). If the wanted filename does need to include a colon, then
4904acd5
JM
207escape that with a '\\' character. For instance, if the filename is
208"/dev/dsk/foo@3,0:c", then you would use filename="/dev/dsk/foo@3,0\\:c".
d60e92d1 209.TP
de98bd30 210.BI filename_format \fR=\fPstr
ce594fbe 211If sharing multiple files between jobs, it is usually necessary to have
de98bd30
JA
212fio generate the exact names that you want. By default, fio will name a file
213based on the default file format specification of
214\fBjobname.jobnumber.filenumber\fP. With this option, that can be
215customized. Fio will recognize and replace the following keywords in this
216string:
217.RS
218.RS
219.TP
220.B $jobname
221The name of the worker thread or process.
222.TP
223.B $jobnum
224The incremental number of the worker thread or process.
225.TP
226.B $filenum
227The incremental number of the file for that worker thread or process.
228.RE
229.P
230To have dependent jobs share a set of files, this option can be set to
231have fio generate filenames that are shared between the two. For instance,
232if \fBtestfiles.$filenum\fR is specified, file number 4 for any job will
233be named \fBtestfiles.4\fR. The default of \fB$jobname.$jobnum.$filenum\fR
234will be used if no other format specifier is given.
235.RE
236.P
237.TP
3ce9dcaf
JA
238.BI lockfile \fR=\fPstr
239Fio defaults to not locking any files before it does IO to them. If a file or
240file descriptor is shared, fio can serialize IO to that file to make the end
241result consistent. This is usual for emulating real workloads that share files.
242The lock modes are:
243.RS
244.RS
245.TP
246.B none
247No locking. This is the default.
248.TP
249.B exclusive
cf145d90 250Only one thread or process may do IO at a time, excluding all others.
3ce9dcaf
JA
251.TP
252.B readwrite
253Read-write locking on the file. Many readers may access the file at the same
254time, but writes get exclusive access.
255.RE
ce594fbe 256.RE
3ce9dcaf 257.P
d60e92d1
AC
258.BI opendir \fR=\fPstr
259Recursively open any files below directory \fIstr\fR.
260.TP
261.BI readwrite \fR=\fPstr "\fR,\fP rw" \fR=\fPstr
262Type of I/O pattern. Accepted values are:
263.RS
264.RS
265.TP
266.B read
d1429b5c 267Sequential reads.
d60e92d1
AC
268.TP
269.B write
d1429b5c 270Sequential writes.
d60e92d1 271.TP
fa769d44
SW
272.B trim
273Sequential trim (Linux block devices only).
274.TP
d60e92d1 275.B randread
d1429b5c 276Random reads.
d60e92d1
AC
277.TP
278.B randwrite
d1429b5c 279Random writes.
d60e92d1 280.TP
fa769d44
SW
281.B randtrim
282Random trim (Linux block devices only).
283.TP
10b023db 284.B rw, readwrite
d1429b5c 285Mixed sequential reads and writes.
d60e92d1
AC
286.TP
287.B randrw
d1429b5c 288Mixed random reads and writes.
d60e92d1
AC
289.RE
290.P
38dad62d
JA
291For mixed I/O, the default split is 50/50. For certain types of io the result
292may still be skewed a bit, since the speed may be different. It is possible to
3b7fa9ec 293specify a number of IO's to do before getting a new offset, this is done by
38dad62d
JA
294appending a `:\fI<nr>\fR to the end of the string given. For a random read, it
295would look like \fBrw=randread:8\fR for passing in an offset modifier with a
059b0802
JA
296value of 8. If the postfix is used with a sequential IO pattern, then the value
297specified will be added to the generated offset for each IO. For instance,
298using \fBrw=write:4k\fR will skip 4k for every write. It turns sequential IO
299into sequential IO with holes. See the \fBrw_sequencer\fR option.
d60e92d1
AC
300.RE
301.TP
38dad62d
JA
302.BI rw_sequencer \fR=\fPstr
303If an offset modifier is given by appending a number to the \fBrw=<str>\fR line,
304then this option controls how that number modifies the IO offset being
305generated. Accepted values are:
306.RS
307.RS
308.TP
309.B sequential
310Generate sequential offset
311.TP
312.B identical
313Generate the same offset
314.RE
315.P
316\fBsequential\fR is only useful for random IO, where fio would normally
317generate a new random offset for every IO. If you append eg 8 to randread, you
318would get a new random offset for every 8 IO's. The result would be a seek for
319only every 8 IO's, instead of for every IO. Use \fBrw=randread:8\fR to specify
320that. As sequential IO is already sequential, setting \fBsequential\fR for that
321would not result in any differences. \fBidentical\fR behaves in a similar
322fashion, except it sends the same offset 8 number of times before generating a
323new offset.
324.RE
325.P
326.TP
90fef2d1
JA
327.BI kb_base \fR=\fPint
328The base unit for a kilobyte. The defacto base is 2^10, 1024. Storage
329manufacturers like to use 10^3 or 1000 as a base ten unit instead, for obvious
5c9323fb 330reasons. Allowed values are 1024 or 1000, with 1024 being the default.
90fef2d1 331.TP
771e58be
JA
332.BI unified_rw_reporting \fR=\fPbool
333Fio normally reports statistics on a per data direction basis, meaning that
334read, write, and trim are accounted and reported separately. If this option is
cf145d90 335set fio sums the results and reports them as "mixed" instead.
771e58be 336.TP
d60e92d1 337.BI randrepeat \fR=\fPbool
56e2a5fc
CE
338Seed the random number generator used for random I/O patterns in a predictable
339way so the pattern is repeatable across runs. Default: true.
340.TP
341.BI allrandrepeat \fR=\fPbool
342Seed all random number generators in a predictable way so results are
343repeatable across runs. Default: false.
d60e92d1 344.TP
04778baf
JA
345.BI randseed \fR=\fPint
346Seed the random number generators based on this seed value, to be able to
347control what sequence of output is being generated. If not set, the random
348sequence depends on the \fBrandrepeat\fR setting.
349.TP
2615cc4b 350.BI use_os_rand \fR=\fPbool
cf145d90
CVB
351Fio can either use the random generator supplied by the OS to generate random
352offsets, or it can use its own internal generator (based on Tausworthe).
2615cc4b
JA
353Default is to use the internal generator, which is often of better quality and
354faster. Default: false.
355.TP
a596f047
EG
356.BI fallocate \fR=\fPstr
357Whether pre-allocation is performed when laying down files. Accepted values
358are:
359.RS
360.RS
361.TP
362.B none
363Do not pre-allocate space.
364.TP
365.B posix
ccc2b328 366Pre-allocate via \fBposix_fallocate\fR\|(3).
a596f047
EG
367.TP
368.B keep
ccc2b328 369Pre-allocate via \fBfallocate\fR\|(2) with FALLOC_FL_KEEP_SIZE set.
a596f047
EG
370.TP
371.B 0
372Backward-compatible alias for 'none'.
373.TP
374.B 1
375Backward-compatible alias for 'posix'.
376.RE
377.P
378May not be available on all supported platforms. 'keep' is only
379available on Linux. If using ZFS on Solaris this must be set to 'none'
380because ZFS doesn't support it. Default: 'posix'.
381.RE
7bc8c2cf 382.TP
d60e92d1 383.BI fadvise_hint \fR=\fPbool
cf145d90 384Use \fBposix_fadvise\fR\|(2) to advise the kernel what I/O patterns
d1429b5c 385are likely to be issued. Default: true.
d60e92d1 386.TP
f7fa2653 387.BI size \fR=\fPint
d60e92d1 388Total size of I/O for this job. \fBfio\fR will run until this many bytes have
ca45881f 389been transferred, unless limited by other options (\fBruntime\fR, for instance).
d7c8be03 390Unless \fBnrfiles\fR and \fBfilesize\fR options are given, this amount will be
d6667268 391divided between the available files for the job. If not set, fio will use the
cecbfd47 392full size of the given files or devices. If the files do not exist, size
7bb59102 393must be given. It is also possible to give size as a percentage between 1 and
77731b29
JA
394100. If size=20% is given, fio will use 20% of the full size of the given
395files or devices.
396.TP
397.BI io_limit \fR=\fPint
398Normally fio operates within the region set by \fBsize\fR, which means that
399the \fBsize\fR option sets both the region and size of IO to be performed.
400Sometimes that is not what you want. With this option, it is possible to
401define just the amount of IO that fio should do. For instance, if \fBsize\fR
402is set to 20G and \fBio_limit\fR is set to 5G, fio will perform IO within
403the first 20G but exit when 5G have been done.
d60e92d1 404.TP
74586c1e 405.BI fill_device \fR=\fPbool "\fR,\fB fill_fs" \fR=\fPbool
3ce9dcaf
JA
406Sets size to something really large and waits for ENOSPC (no space left on
407device) as the terminating condition. Only makes sense with sequential write.
408For a read workload, the mount point will be filled first then IO started on
4f12432e
JA
409the result. This option doesn't make sense if operating on a raw device node,
410since the size of that is already known by the file system. Additionally,
411writing beyond end-of-device will not return ENOSPC there.
3ce9dcaf 412.TP
d60e92d1
AC
413.BI filesize \fR=\fPirange
414Individual file sizes. May be a range, in which case \fBfio\fR will select sizes
d1429b5c
AC
415for files at random within the given range, limited to \fBsize\fR in total (if
416that is given). If \fBfilesize\fR is not specified, each created file is the
417same size.
d60e92d1 418.TP
bedc9dc2
JA
419.BI file_append \fR=\fPbool
420Perform IO after the end of the file. Normally fio will operate within the
421size of a file. If this option is set, then fio will append to the file
422instead. This has identical behavior to setting \fRoffset\fP to the size
0aae4ce7 423of a file. This option is ignored on non-regular files.
bedc9dc2 424.TP
f7fa2653 425.BI blocksize \fR=\fPint[,int] "\fR,\fB bs" \fR=\fPint[,int]
d9472271
JA
426Block size for I/O units. Default: 4k. Values for reads, writes, and trims
427can be specified separately in the format \fIread\fR,\fIwrite\fR,\fItrim\fR
428either of which may be empty to leave that value at its default. If a trailing
429comma isn't given, the remainder will inherit the last value set.
d60e92d1 430.TP
9183788d 431.BI blocksize_range \fR=\fPirange[,irange] "\fR,\fB bsrange" \fR=\fPirange[,irange]
d1429b5c
AC
432Specify a range of I/O block sizes. The issued I/O unit will always be a
433multiple of the minimum size, unless \fBblocksize_unaligned\fR is set. Applies
9183788d 434to both reads and writes if only one range is given, but can be specified
de8f6de9 435separately with a comma separating the values. Example: bsrange=1k-4k,2k-8k.
9183788d
JA
436Also (see \fBblocksize\fR).
437.TP
438.BI bssplit \fR=\fPstr
439This option allows even finer grained control of the block sizes issued,
440not just even splits between them. With this option, you can weight various
441block sizes for exact control of the issued IO for a job that has mixed
442block sizes. The format of the option is bssplit=blocksize/percentage,
5982a925 443optionally adding as many definitions as needed separated by a colon.
9183788d 444Example: bssplit=4k/10:64k/50:32k/40 would issue 50% 64k blocks, 10% 4k
c83cdd3e
JA
445blocks and 40% 32k blocks. \fBbssplit\fR also supports giving separate
446splits to reads and writes. The format is identical to what the
447\fBbs\fR option accepts, the read and write parts are separated with a
448comma.
d60e92d1
AC
449.TP
450.B blocksize_unaligned\fR,\fP bs_unaligned
d1429b5c
AC
451If set, any size in \fBblocksize_range\fR may be used. This typically won't
452work with direct I/O, as that normally requires sector alignment.
d60e92d1 453.TP
2b7a01d0 454.BI blockalign \fR=\fPint[,int] "\fR,\fB ba" \fR=\fPint[,int]
639ce0f3
MS
455At what boundary to align random IO offsets. Defaults to the same as 'blocksize'
456the minimum blocksize given. Minimum alignment is typically 512b
2b7a01d0
JA
457for using direct IO, though it usually depends on the hardware block size.
458This option is mutually exclusive with using a random map for files, so it
459will turn off that option.
43602667 460.TP
6aca9b3d
JA
461.BI bs_is_seq_rand \fR=\fPbool
462If this option is set, fio will use the normal read,write blocksize settings as
463sequential,random instead. Any random read or write will use the WRITE
464blocksize settings, and any sequential read or write will use the READ
465blocksize setting.
466.TP
d60e92d1 467.B zero_buffers
cf145d90 468Initialize buffers with all zeros. Default: fill buffers with random data.
7750aac4
JA
469The resulting IO buffers will not be completely zeroed, unless
470\fPscramble_buffers\fR is also turned off.
d60e92d1 471.TP
901bb994
JA
472.B refill_buffers
473If this option is given, fio will refill the IO buffers on every submit. The
474default is to only fill it at init time and reuse that data. Only makes sense
475if zero_buffers isn't specified, naturally. If data verification is enabled,
476refill_buffers is also automatically enabled.
477.TP
fd68418e
JA
478.BI scramble_buffers \fR=\fPbool
479If \fBrefill_buffers\fR is too costly and the target is using data
480deduplication, then setting this option will slightly modify the IO buffer
481contents to defeat normal de-dupe attempts. This is not enough to defeat
482more clever block compression attempts, but it will stop naive dedupe
483of blocks. Default: true.
484.TP
c5751c62
JA
485.BI buffer_compress_percentage \fR=\fPint
486If this is set, then fio will attempt to provide IO buffer content (on WRITEs)
487that compress to the specified level. Fio does this by providing a mix of
488random data and zeroes. Note that this is per block size unit, for file/disk
489wide compression level that matches this setting, you'll also want to set
490\fBrefill_buffers\fR.
491.TP
492.BI buffer_compress_chunk \fR=\fPint
493See \fBbuffer_compress_percentage\fR. This setting allows fio to manage how
494big the ranges of random data and zeroed data is. Without this set, fio will
495provide \fBbuffer_compress_percentage\fR of blocksize random data, followed by
496the remaining zeroed. With this set to some chunk size smaller than the block
497size, fio can alternate random and zeroed data throughout the IO buffer.
498.TP
ce35b1ec 499.BI buffer_pattern \fR=\fPstr
cf145d90
CVB
500If set, fio will fill the IO buffers with this pattern. If not set, the contents
501of IO buffers is defined by the other options related to buffer contents. The
ce35b1ec 502setting can be any pattern of bytes, and can be prefixed with 0x for hex
02975b64
JA
503values. It may also be a string, where the string must then be wrapped with
504"".
ce35b1ec 505.TP
5c94b008
JA
506.BI dedupe_percentage \fR=\fPint
507If set, fio will generate this percentage of identical buffers when writing.
508These buffers will be naturally dedupable. The contents of the buffers depend
509on what other buffer compression settings have been set. It's possible to have
510the individual buffers either fully compressible, or not at all. This option
511only controls the distribution of unique buffers.
512.TP
d60e92d1
AC
513.BI nrfiles \fR=\fPint
514Number of files to use for this job. Default: 1.
515.TP
516.BI openfiles \fR=\fPint
517Number of files to keep open at the same time. Default: \fBnrfiles\fR.
518.TP
519.BI file_service_type \fR=\fPstr
520Defines how files to service are selected. The following types are defined:
521.RS
522.RS
523.TP
524.B random
5c9323fb 525Choose a file at random.
d60e92d1
AC
526.TP
527.B roundrobin
cf145d90 528Round robin over opened files (default).
5c9323fb 529.TP
6b7f6851
JA
530.B sequential
531Do each file in the set sequentially.
d60e92d1
AC
532.RE
533.P
cf145d90 534The number of I/Os to issue before switching to a new file can be specified by
d60e92d1
AC
535appending `:\fIint\fR' to the service type.
536.RE
537.TP
538.BI ioengine \fR=\fPstr
539Defines how the job issues I/O. The following types are defined:
540.RS
541.RS
542.TP
543.B sync
ccc2b328 544Basic \fBread\fR\|(2) or \fBwrite\fR\|(2) I/O. \fBfseek\fR\|(2) is used to
d60e92d1
AC
545position the I/O location.
546.TP
a31041ea 547.B psync
ccc2b328 548Basic \fBpread\fR\|(2) or \fBpwrite\fR\|(2) I/O.
a31041ea 549.TP
9183788d 550.B vsync
ccc2b328 551Basic \fBreadv\fR\|(2) or \fBwritev\fR\|(2) I/O. Will emulate queuing by
cecbfd47 552coalescing adjacent IOs into a single submission.
9183788d 553.TP
a46c5e01 554.B pvsync
ccc2b328 555Basic \fBpreadv\fR\|(2) or \fBpwritev\fR\|(2) I/O.
a46c5e01 556.TP
d60e92d1 557.B libaio
de890a1e 558Linux native asynchronous I/O. This ioengine defines engine specific options.
d60e92d1
AC
559.TP
560.B posixaio
ccc2b328 561POSIX asynchronous I/O using \fBaio_read\fR\|(3) and \fBaio_write\fR\|(3).
03e20d68
BC
562.TP
563.B solarisaio
564Solaris native asynchronous I/O.
565.TP
566.B windowsaio
567Windows native asynchronous I/O.
d60e92d1
AC
568.TP
569.B mmap
ccc2b328
SW
570File is memory mapped with \fBmmap\fR\|(2) and data copied using
571\fBmemcpy\fR\|(3).
d60e92d1
AC
572.TP
573.B splice
ccc2b328 574\fBsplice\fR\|(2) is used to transfer the data and \fBvmsplice\fR\|(2) to
d1429b5c 575transfer data from user-space to the kernel.
d60e92d1
AC
576.TP
577.B syslet-rw
578Use the syslet system calls to make regular read/write asynchronous.
579.TP
580.B sg
581SCSI generic sg v3 I/O. May be either synchronous using the SG_IO ioctl, or if
ccc2b328
SW
582the target is an sg character device, we use \fBread\fR\|(2) and
583\fBwrite\fR\|(2) for asynchronous I/O.
d60e92d1
AC
584.TP
585.B null
586Doesn't transfer any data, just pretends to. Mainly used to exercise \fBfio\fR
587itself and for debugging and testing purposes.
588.TP
589.B net
de890a1e
SL
590Transfer over the network. The protocol to be used can be defined with the
591\fBprotocol\fR parameter. Depending on the protocol, \fBfilename\fR,
592\fBhostname\fR, \fBport\fR, or \fBlisten\fR must be specified.
593This ioengine defines engine specific options.
d60e92d1
AC
594.TP
595.B netsplice
ccc2b328 596Like \fBnet\fR, but uses \fBsplice\fR\|(2) and \fBvmsplice\fR\|(2) to map data
de890a1e 597and send/receive. This ioengine defines engine specific options.
d60e92d1 598.TP
53aec0a4 599.B cpuio
d60e92d1
AC
600Doesn't transfer any data, but burns CPU cycles according to \fBcpuload\fR and
601\fBcpucycles\fR parameters.
602.TP
603.B guasi
604The GUASI I/O engine is the Generic Userspace Asynchronous Syscall Interface
cecbfd47 605approach to asynchronous I/O.
d1429b5c
AC
606.br
607See <http://www.xmailserver.org/guasi\-lib.html>.
d60e92d1 608.TP
21b8aee8 609.B rdma
85286c5c
BVA
610The RDMA I/O engine supports both RDMA memory semantics (RDMA_WRITE/RDMA_READ)
611and channel semantics (Send/Recv) for the InfiniBand, RoCE and iWARP protocols.
21b8aee8 612.TP
d60e92d1
AC
613.B external
614Loads an external I/O engine object file. Append the engine filename as
615`:\fIenginepath\fR'.
d54fce84
DM
616.TP
617.B falloc
cecbfd47 618 IO engine that does regular linux native fallocate call to simulate data
d54fce84
DM
619transfer as fio ioengine
620.br
621 DDIR_READ does fallocate(,mode = FALLOC_FL_KEEP_SIZE,)
622.br
0981fd71 623 DIR_WRITE does fallocate(,mode = 0)
d54fce84
DM
624.br
625 DDIR_TRIM does fallocate(,mode = FALLOC_FL_KEEP_SIZE|FALLOC_FL_PUNCH_HOLE)
626.TP
627.B e4defrag
628IO engine that does regular EXT4_IOC_MOVE_EXT ioctls to simulate defragment activity
629request to DDIR_WRITE event
0d978694
DAG
630.TP
631.B rbd
632IO engine supporting direct access to Ceph Rados Block Devices (RBD) via librbd
633without the need to use the kernel rbd driver. This ioengine defines engine specific
634options.
a7c386f4 635.TP
636.B gfapi
cc47f094 637Using Glusterfs libgfapi sync interface to direct access to Glusterfs volumes without
638having to go through FUSE. This ioengine defines engine specific
639options.
640.TP
641.B gfapi_async
642Using Glusterfs libgfapi async interface to direct access to Glusterfs volumes without
a7c386f4 643having to go through FUSE. This ioengine defines engine specific
644options.
1b10477b 645.TP
b74e419e
MM
646.B libhdfs
647Read and write through Hadoop (HDFS). The \fBfilename\fR option is used to
648specify host,port of the hdfs name-node to connect. This engine interprets
649offsets a little differently. In HDFS, files once created cannot be modified.
650So random writes are not possible. To imitate this, libhdfs engine expects
651bunch of small files to be created over HDFS, and engine will randomly pick a
652file out of those files based on the offset generated by fio backend. (see the
653example job file to create such files, use rw=write option). Please note, you
654might want to set necessary environment variables to work with hdfs/libhdfs
655properly.
d60e92d1 656.RE
595e1734 657.P
d60e92d1
AC
658.RE
659.TP
660.BI iodepth \fR=\fPint
8489dae4
SK
661Number of I/O units to keep in flight against the file. Note that increasing
662iodepth beyond 1 will not affect synchronous ioengines (except for small
cf145d90 663degress when verify_async is in use). Even async engines may impose OS
ee72ca09
JA
664restrictions causing the desired depth not to be achieved. This may happen on
665Linux when using libaio and not setting \fBdirect\fR=1, since buffered IO is
666not async on that OS. Keep an eye on the IO depth distribution in the
667fio output to verify that the achieved depth is as expected. Default: 1.
d60e92d1
AC
668.TP
669.BI iodepth_batch \fR=\fPint
670Number of I/Os to submit at once. Default: \fBiodepth\fR.
671.TP
3ce9dcaf
JA
672.BI iodepth_batch_complete \fR=\fPint
673This defines how many pieces of IO to retrieve at once. It defaults to 1 which
674 means that we'll ask for a minimum of 1 IO in the retrieval process from the
675kernel. The IO retrieval will go on until we hit the limit set by
676\fBiodepth_low\fR. If this variable is set to 0, then fio will always check for
677completed events before queuing more IO. This helps reduce IO latency, at the
678cost of more retrieval system calls.
679.TP
d60e92d1
AC
680.BI iodepth_low \fR=\fPint
681Low watermark indicating when to start filling the queue again. Default:
682\fBiodepth\fR.
683.TP
684.BI direct \fR=\fPbool
685If true, use non-buffered I/O (usually O_DIRECT). Default: false.
686.TP
d01612f3
CM
687.BI atomic \fR=\fPbool
688If value is true, attempt to use atomic direct IO. Atomic writes are guaranteed
689to be stable once acknowledged by the operating system. Only Linux supports
690O_ATOMIC right now.
691.TP
d60e92d1
AC
692.BI buffered \fR=\fPbool
693If true, use buffered I/O. This is the opposite of the \fBdirect\fR parameter.
694Default: true.
695.TP
f7fa2653 696.BI offset \fR=\fPint
d60e92d1
AC
697Offset in the file to start I/O. Data before the offset will not be touched.
698.TP
591e9e06
JA
699.BI offset_increment \fR=\fPint
700If this is provided, then the real offset becomes the
69bdd6ba
JH
701offset + offset_increment * thread_number, where the thread number is a
702counter that starts at 0 and is incremented for each sub-job (i.e. when
703numjobs option is specified). This option is useful if there are several jobs
704which are intended to operate on a file in parallel disjoint segments, with
705even spacing between the starting points.
591e9e06 706.TP
ddf24e42
JA
707.BI number_ios \fR=\fPint
708Fio will normally perform IOs until it has exhausted the size of the region
709set by \fBsize\fR, or if it exhaust the allocated time (or hits an error
710condition). With this setting, the range/size can be set independently of
711the number of IOs to perform. When fio reaches this number, it will exit
be3fec7d
JA
712normally and report status. Note that this does not extend the amount
713of IO that will be done, it will only stop fio if this condition is met
714before other end-of-job criteria.
ddf24e42 715.TP
d60e92d1 716.BI fsync \fR=\fPint
d1429b5c
AC
717How many I/Os to perform before issuing an \fBfsync\fR\|(2) of dirty data. If
7180, don't sync. Default: 0.
d60e92d1 719.TP
5f9099ea
JA
720.BI fdatasync \fR=\fPint
721Like \fBfsync\fR, but uses \fBfdatasync\fR\|(2) instead to only sync the
722data parts of the file. Default: 0.
723.TP
fa769d44
SW
724.BI write_barrier \fR=\fPint
725Make every Nth write a barrier write.
726.TP
e76b1da4 727.BI sync_file_range \fR=\fPstr:int
ccc2b328
SW
728Use \fBsync_file_range\fR\|(2) for every \fRval\fP number of write operations. Fio will
729track range of writes that have happened since the last \fBsync_file_range\fR\|(2) call.
e76b1da4
JA
730\fRstr\fP can currently be one or more of:
731.RS
732.TP
733.B wait_before
734SYNC_FILE_RANGE_WAIT_BEFORE
735.TP
736.B write
737SYNC_FILE_RANGE_WRITE
738.TP
739.B wait_after
740SYNC_FILE_RANGE_WRITE
741.TP
742.RE
743.P
744So if you do sync_file_range=wait_before,write:8, fio would use
745\fBSYNC_FILE_RANGE_WAIT_BEFORE | SYNC_FILE_RANGE_WRITE\fP for every 8 writes.
ccc2b328 746Also see the \fBsync_file_range\fR\|(2) man page. This option is Linux specific.
e76b1da4 747.TP
d60e92d1 748.BI overwrite \fR=\fPbool
d1429b5c 749If writing, setup the file first and do overwrites. Default: false.
d60e92d1
AC
750.TP
751.BI end_fsync \fR=\fPbool
dbd11ead 752Sync file contents when a write stage has completed. Default: false.
d60e92d1
AC
753.TP
754.BI fsync_on_close \fR=\fPbool
755If true, sync file contents on close. This differs from \fBend_fsync\fR in that
d1429b5c 756it will happen on every close, not just at the end of the job. Default: false.
d60e92d1 757.TP
d60e92d1
AC
758.BI rwmixread \fR=\fPint
759Percentage of a mixed workload that should be reads. Default: 50.
760.TP
761.BI rwmixwrite \fR=\fPint
d1429b5c 762Percentage of a mixed workload that should be writes. If \fBrwmixread\fR and
c35dd7a6
JA
763\fBrwmixwrite\fR are given and do not sum to 100%, the latter of the two
764overrides the first. This may interfere with a given rate setting, if fio is
765asked to limit reads or writes to a certain rate. If that is the case, then
766the distribution may be skewed. Default: 50.
d60e92d1 767.TP
92d42d69
JA
768.BI random_distribution \fR=\fPstr:float
769By default, fio will use a completely uniform random distribution when asked
770to perform random IO. Sometimes it is useful to skew the distribution in
771specific ways, ensuring that some parts of the data is more hot than others.
772Fio includes the following distribution models:
773.RS
774.TP
775.B random
776Uniform random distribution
777.TP
778.B zipf
779Zipf distribution
780.TP
781.B pareto
782Pareto distribution
783.TP
784.RE
785.P
786When using a zipf or pareto distribution, an input value is also needed to
787define the access pattern. For zipf, this is the zipf theta. For pareto,
788it's the pareto power. Fio includes a test program, genzipf, that can be
789used visualize what the given input values will yield in terms of hit rates.
790If you wanted to use zipf with a theta of 1.2, you would use
791random_distribution=zipf:1.2 as the option. If a non-uniform model is used,
792fio will disable use of the random map.
793.TP
211c9b89
JA
794.BI percentage_random \fR=\fPint
795For a random workload, set how big a percentage should be random. This defaults
796to 100%, in which case the workload is fully random. It can be set from
797anywhere from 0 to 100. Setting it to 0 would make the workload fully
d9472271
JA
798sequential. It is possible to set different values for reads, writes, and
799trim. To do so, simply use a comma separated list. See \fBblocksize\fR.
211c9b89 800.TP
d60e92d1
AC
801.B norandommap
802Normally \fBfio\fR will cover every block of the file when doing random I/O. If
803this parameter is given, a new offset will be chosen without looking at past
804I/O history. This parameter is mutually exclusive with \fBverify\fR.
805.TP
744492c9 806.BI softrandommap \fR=\fPbool
3ce9dcaf
JA
807See \fBnorandommap\fR. If fio runs with the random block map enabled and it
808fails to allocate the map, if this option is set it will continue without a
809random block map. As coverage will not be as complete as with random maps, this
810option is disabled by default.
811.TP
e8b1961d
JA
812.BI random_generator \fR=\fPstr
813Fio supports the following engines for generating IO offsets for random IO:
814.RS
815.TP
816.B tausworthe
817Strong 2^88 cycle random number generator
818.TP
819.B lfsr
820Linear feedback shift register generator
821.TP
822.RE
823.P
824Tausworthe is a strong random number generator, but it requires tracking on the
825side if we want to ensure that blocks are only read or written once. LFSR
826guarantees that we never generate the same offset twice, and it's also less
827computationally expensive. It's not a true random generator, however, though
828for IO purposes it's typically good enough. LFSR only works with single block
829sizes, not with workloads that use multiple block sizes. If used with such a
830workload, fio may read or write some blocks multiple times.
831.TP
d60e92d1 832.BI nice \fR=\fPint
ccc2b328 833Run job with given nice value. See \fBnice\fR\|(2).
d60e92d1
AC
834.TP
835.BI prio \fR=\fPint
836Set I/O priority value of this job between 0 (highest) and 7 (lowest). See
ccc2b328 837\fBionice\fR\|(1).
d60e92d1
AC
838.TP
839.BI prioclass \fR=\fPint
ccc2b328 840Set I/O priority class. See \fBionice\fR\|(1).
d60e92d1
AC
841.TP
842.BI thinktime \fR=\fPint
843Stall job for given number of microseconds between issuing I/Os.
844.TP
845.BI thinktime_spin \fR=\fPint
846Pretend to spend CPU time for given number of microseconds, sleeping the rest
847of the time specified by \fBthinktime\fR. Only valid if \fBthinktime\fR is set.
848.TP
849.BI thinktime_blocks \fR=\fPint
4d01ece6
JA
850Only valid if thinktime is set - control how many blocks to issue, before
851waiting \fBthinktime\fR microseconds. If not set, defaults to 1 which will
852make fio wait \fBthinktime\fR microseconds after every block. This
853effectively makes any queue depth setting redundant, since no more than 1 IO
854will be queued before we have to complete it and do our thinktime. In other
855words, this setting effectively caps the queue depth if the latter is larger.
d60e92d1
AC
856Default: 1.
857.TP
858.BI rate \fR=\fPint
c35dd7a6
JA
859Cap bandwidth used by this job. The number is in bytes/sec, the normal postfix
860rules apply. You can use \fBrate\fR=500k to limit reads and writes to 500k each,
861or you can specify read and writes separately. Using \fBrate\fR=1m,500k would
862limit reads to 1MB/sec and writes to 500KB/sec. Capping only reads or writes
863can be done with \fBrate\fR=,500k or \fBrate\fR=500k,. The former will only
864limit writes (to 500KB/sec), the latter will only limit reads.
d60e92d1
AC
865.TP
866.BI ratemin \fR=\fPint
867Tell \fBfio\fR to do whatever it can to maintain at least the given bandwidth.
c35dd7a6
JA
868Failing to meet this requirement will cause the job to exit. The same format
869as \fBrate\fR is used for read vs write separation.
d60e92d1
AC
870.TP
871.BI rate_iops \fR=\fPint
c35dd7a6
JA
872Cap the bandwidth to this number of IOPS. Basically the same as rate, just
873specified independently of bandwidth. The same format as \fBrate\fR is used for
de8f6de9 874read vs write separation. If \fBblocksize\fR is a range, the smallest block
c35dd7a6 875size is used as the metric.
d60e92d1
AC
876.TP
877.BI rate_iops_min \fR=\fPint
c35dd7a6 878If this rate of I/O is not met, the job will exit. The same format as \fBrate\fR
de8f6de9 879is used for read vs write separation.
d60e92d1
AC
880.TP
881.BI ratecycle \fR=\fPint
882Average bandwidth for \fBrate\fR and \fBratemin\fR over this number of
883milliseconds. Default: 1000ms.
884.TP
3e260a46
JA
885.BI latency_target \fR=\fPint
886If set, fio will attempt to find the max performance point that the given
887workload will run at while maintaining a latency below this target. The
888values is given in microseconds. See \fBlatency_window\fR and
889\fBlatency_percentile\fR.
890.TP
891.BI latency_window \fR=\fPint
892Used with \fBlatency_target\fR to specify the sample window that the job
893is run at varying queue depths to test the performance. The value is given
894in microseconds.
895.TP
896.BI latency_percentile \fR=\fPfloat
897The percentage of IOs that must fall within the criteria specified by
898\fBlatency_target\fR and \fBlatency_window\fR. If not set, this defaults
899to 100.0, meaning that all IOs must be equal or below to the value set
900by \fBlatency_target\fR.
901.TP
15501535
JA
902.BI max_latency \fR=\fPint
903If set, fio will exit the job if it exceeds this maximum latency. It will exit
904with an ETIME error.
905.TP
d60e92d1
AC
906.BI cpumask \fR=\fPint
907Set CPU affinity for this job. \fIint\fR is a bitmask of allowed CPUs the job
908may run on. See \fBsched_setaffinity\fR\|(2).
909.TP
910.BI cpus_allowed \fR=\fPstr
911Same as \fBcpumask\fR, but allows a comma-delimited list of CPU numbers.
912.TP
c2acfbac
JA
913.BI cpus_allowed_policy \fR=\fPstr
914Set the policy of how fio distributes the CPUs specified by \fBcpus_allowed\fR
915or \fBcpumask\fR. Two policies are supported:
916.RS
917.RS
918.TP
919.B shared
920All jobs will share the CPU set specified.
921.TP
922.B split
923Each job will get a unique CPU from the CPU set.
924.RE
925.P
926\fBshared\fR is the default behaviour, if the option isn't specified. If
ada083cd
JA
927\fBsplit\fR is specified, then fio will assign one cpu per job. If not enough
928CPUs are given for the jobs listed, then fio will roundrobin the CPUs in
929the set.
c2acfbac
JA
930.RE
931.P
932.TP
d0b937ed 933.BI numa_cpu_nodes \fR=\fPstr
cecbfd47 934Set this job running on specified NUMA nodes' CPUs. The arguments allow
d0b937ed
YR
935comma delimited list of cpu numbers, A-B ranges, or 'all'.
936.TP
937.BI numa_mem_policy \fR=\fPstr
938Set this job's memory policy and corresponding NUMA nodes. Format of
cecbfd47 939the arguments:
d0b937ed
YR
940.RS
941.TP
942.B <mode>[:<nodelist>]
943.TP
944.B mode
945is one of the following memory policy:
946.TP
947.B default, prefer, bind, interleave, local
948.TP
949.RE
950For \fBdefault\fR and \fBlocal\fR memory policy, no \fBnodelist\fR is
951needed to be specified. For \fBprefer\fR, only one node is
952allowed. For \fBbind\fR and \fBinterleave\fR, \fBnodelist\fR allows
953comma delimited list of numbers, A-B ranges, or 'all'.
954.TP
23ed19b0
CE
955.BI startdelay \fR=\fPirange
956Delay start of job for the specified number of seconds. Supports all time
957suffixes to allow specification of hours, minutes, seconds and
958milliseconds - seconds are the default if a unit is ommited.
959Can be given as a range which causes each thread to choose randomly out of the
960range.
d60e92d1
AC
961.TP
962.BI runtime \fR=\fPint
963Terminate processing after the specified number of seconds.
964.TP
965.B time_based
966If given, run for the specified \fBruntime\fR duration even if the files are
967completely read or written. The same workload will be repeated as many times
968as \fBruntime\fR allows.
969.TP
901bb994
JA
970.BI ramp_time \fR=\fPint
971If set, fio will run the specified workload for this amount of time before
972logging any performance numbers. Useful for letting performance settle before
973logging results, thus minimizing the runtime required for stable results. Note
c35dd7a6
JA
974that the \fBramp_time\fR is considered lead in time for a job, thus it will
975increase the total runtime if a special timeout or runtime is specified.
901bb994 976.TP
d60e92d1
AC
977.BI invalidate \fR=\fPbool
978Invalidate buffer-cache for the file prior to starting I/O. Default: true.
979.TP
980.BI sync \fR=\fPbool
981Use synchronous I/O for buffered writes. For the majority of I/O engines,
d1429b5c 982this means using O_SYNC. Default: false.
d60e92d1
AC
983.TP
984.BI iomem \fR=\fPstr "\fR,\fP mem" \fR=\fPstr
985Allocation method for I/O unit buffer. Allowed values are:
986.RS
987.RS
988.TP
989.B malloc
ccc2b328 990Allocate memory with \fBmalloc\fR\|(3).
d60e92d1
AC
991.TP
992.B shm
ccc2b328 993Use shared memory buffers allocated through \fBshmget\fR\|(2).
d60e92d1
AC
994.TP
995.B shmhuge
996Same as \fBshm\fR, but use huge pages as backing.
997.TP
998.B mmap
ccc2b328 999Use \fBmmap\fR\|(2) for allocation. Uses anonymous memory unless a filename
d60e92d1
AC
1000is given after the option in the format `:\fIfile\fR'.
1001.TP
1002.B mmaphuge
1003Same as \fBmmap\fR, but use huge files as backing.
1004.RE
1005.P
1006The amount of memory allocated is the maximum allowed \fBblocksize\fR for the
1007job multiplied by \fBiodepth\fR. For \fBshmhuge\fR or \fBmmaphuge\fR to work,
1008the system must have free huge pages allocated. \fBmmaphuge\fR also needs to
2e266ba6
JA
1009have hugetlbfs mounted, and \fIfile\fR must point there. At least on Linux,
1010huge pages must be manually allocated. See \fB/proc/sys/vm/nr_hugehages\fR
1011and the documentation for that. Normally you just need to echo an appropriate
1012number, eg echoing 8 will ensure that the OS has 8 huge pages ready for
1013use.
d60e92d1
AC
1014.RE
1015.TP
d392365e 1016.BI iomem_align \fR=\fPint "\fR,\fP mem_align" \fR=\fPint
cecbfd47 1017This indicates the memory alignment of the IO memory buffers. Note that the
d529ee19
JA
1018given alignment is applied to the first IO unit buffer, if using \fBiodepth\fR
1019the alignment of the following buffers are given by the \fBbs\fR used. In
1020other words, if using a \fBbs\fR that is a multiple of the page sized in the
1021system, all buffers will be aligned to this value. If using a \fBbs\fR that
1022is not page aligned, the alignment of subsequent IO memory buffers is the
1023sum of the \fBiomem_align\fR and \fBbs\fR used.
1024.TP
f7fa2653 1025.BI hugepage\-size \fR=\fPint
d60e92d1 1026Defines the size of a huge page. Must be at least equal to the system setting.
b22989b9 1027Should be a multiple of 1MB. Default: 4MB.
d60e92d1
AC
1028.TP
1029.B exitall
1030Terminate all jobs when one finishes. Default: wait for each job to finish.
1031.TP
1032.BI bwavgtime \fR=\fPint
1033Average bandwidth calculations over the given time in milliseconds. Default:
1034500ms.
1035.TP
c8eeb9df
JA
1036.BI iopsavgtime \fR=\fPint
1037Average IOPS calculations over the given time in milliseconds. Default:
1038500ms.
1039.TP
d60e92d1 1040.BI create_serialize \fR=\fPbool
d1429b5c 1041If true, serialize file creation for the jobs. Default: true.
d60e92d1
AC
1042.TP
1043.BI create_fsync \fR=\fPbool
ccc2b328 1044\fBfsync\fR\|(2) data file after creation. Default: true.
d60e92d1 1045.TP
6b7f6851
JA
1046.BI create_on_open \fR=\fPbool
1047If true, the files are not created until they are opened for IO by the job.
1048.TP
25460cf6
JA
1049.BI create_only \fR=\fPbool
1050If true, fio will only run the setup phase of the job. If files need to be
1051laid out or updated on disk, only that will be done. The actual job contents
1052are not executed.
1053.TP
e9f48479
JA
1054.BI pre_read \fR=\fPbool
1055If this is given, files will be pre-read into memory before starting the given
1056IO operation. This will also clear the \fR \fBinvalidate\fR flag, since it is
9c0d2241
JA
1057pointless to pre-read and then drop the cache. This will only work for IO
1058engines that are seekable, since they allow you to read the same data
1059multiple times. Thus it will not work on eg network or splice IO.
e9f48479 1060.TP
d60e92d1
AC
1061.BI unlink \fR=\fPbool
1062Unlink job files when done. Default: false.
1063.TP
1064.BI loops \fR=\fPint
1065Specifies the number of iterations (runs of the same workload) of this job.
1066Default: 1.
1067.TP
5e4c7118
JA
1068.BI verify_only \fR=\fPbool
1069Do not perform the specified workload, only verify data still matches previous
1070invocation of this workload. This option allows one to check data multiple
1071times at a later date without overwriting it. This option makes sense only for
1072workloads that write data, and does not support workloads with the
1073\fBtime_based\fR option set.
1074.TP
d60e92d1
AC
1075.BI do_verify \fR=\fPbool
1076Run the verify phase after a write phase. Only valid if \fBverify\fR is set.
1077Default: true.
1078.TP
1079.BI verify \fR=\fPstr
1080Method of verifying file contents after each iteration of the job. Allowed
1081values are:
1082.RS
1083.RS
1084.TP
844ea602 1085.B md5 crc16 crc32 crc32c crc32c-intel crc64 crc7 sha256 sha512 sha1 xxhash
0539d758
JA
1086Store appropriate checksum in the header of each block. crc32c-intel is
1087hardware accelerated SSE4.2 driven, falls back to regular crc32c if
1088not supported by the system.
d60e92d1
AC
1089.TP
1090.B meta
1091Write extra information about each I/O (timestamp, block number, etc.). The
996093bb 1092block number is verified. See \fBverify_pattern\fR as well.
d60e92d1
AC
1093.TP
1094.B null
1095Pretend to verify. Used for testing internals.
1096.RE
b892dc08
JA
1097
1098This option can be used for repeated burn-in tests of a system to make sure
1099that the written data is also correctly read back. If the data direction given
1100is a read or random read, fio will assume that it should verify a previously
1101written file. If the data direction includes any form of write, the verify will
1102be of the newly written data.
d60e92d1
AC
1103.RE
1104.TP
5c9323fb 1105.BI verifysort \fR=\fPbool
d60e92d1
AC
1106If true, written verify blocks are sorted if \fBfio\fR deems it to be faster to
1107read them back in a sorted manner. Default: true.
1108.TP
fa769d44
SW
1109.BI verifysort_nr \fR=\fPint
1110Pre-load and sort verify blocks for a read workload.
1111.TP
f7fa2653 1112.BI verify_offset \fR=\fPint
d60e92d1 1113Swap the verification header with data somewhere else in the block before
d1429b5c 1114writing. It is swapped back before verifying.
d60e92d1 1115.TP
f7fa2653 1116.BI verify_interval \fR=\fPint
d60e92d1
AC
1117Write the verification header for this number of bytes, which should divide
1118\fBblocksize\fR. Default: \fBblocksize\fR.
1119.TP
996093bb
JA
1120.BI verify_pattern \fR=\fPstr
1121If set, fio will fill the io buffers with this pattern. Fio defaults to filling
1122with totally random bytes, but sometimes it's interesting to fill with a known
1123pattern for io verification purposes. Depending on the width of the pattern,
1124fio will fill 1/2/3/4 bytes of the buffer at the time(it can be either a
1125decimal or a hex number). The verify_pattern if larger than a 32-bit quantity
1126has to be a hex number that starts with either "0x" or "0X". Use with
1127\fBverify\fP=meta.
1128.TP
d60e92d1
AC
1129.BI verify_fatal \fR=\fPbool
1130If true, exit the job on the first observed verification failure. Default:
1131false.
1132.TP
b463e936
JA
1133.BI verify_dump \fR=\fPbool
1134If set, dump the contents of both the original data block and the data block we
1135read off disk to files. This allows later analysis to inspect just what kind of
ef71e317 1136data corruption occurred. Off by default.
b463e936 1137.TP
e8462bd8
JA
1138.BI verify_async \fR=\fPint
1139Fio will normally verify IO inline from the submitting thread. This option
1140takes an integer describing how many async offload threads to create for IO
1141verification instead, causing fio to offload the duty of verifying IO contents
c85c324c
JA
1142to one or more separate threads. If using this offload option, even sync IO
1143engines can benefit from using an \fBiodepth\fR setting higher than 1, as it
1144allows them to have IO in flight while verifies are running.
e8462bd8
JA
1145.TP
1146.BI verify_async_cpus \fR=\fPstr
1147Tell fio to set the given CPU affinity on the async IO verification threads.
1148See \fBcpus_allowed\fP for the format used.
1149.TP
6f87418f
JA
1150.BI verify_backlog \fR=\fPint
1151Fio will normally verify the written contents of a job that utilizes verify
1152once that job has completed. In other words, everything is written then
1153everything is read back and verified. You may want to verify continually
1154instead for a variety of reasons. Fio stores the meta data associated with an
1155IO block in memory, so for large verify workloads, quite a bit of memory would
092f707f
DN
1156be used up holding this meta data. If this option is enabled, fio will write
1157only N blocks before verifying these blocks.
6f87418f
JA
1158.TP
1159.BI verify_backlog_batch \fR=\fPint
1160Control how many blocks fio will verify if verify_backlog is set. If not set,
1161will default to the value of \fBverify_backlog\fR (meaning the entire queue is
092f707f
DN
1162read back and verified). If \fBverify_backlog_batch\fR is less than
1163\fBverify_backlog\fR then not all blocks will be verified, if
1164\fBverify_backlog_batch\fR is larger than \fBverify_backlog\fR, some blocks
1165will be verified more than once.
6f87418f 1166.TP
fa769d44
SW
1167.BI trim_percentage \fR=\fPint
1168Number of verify blocks to discard/trim.
1169.TP
1170.BI trim_verify_zero \fR=\fPbool
1171Verify that trim/discarded blocks are returned as zeroes.
1172.TP
1173.BI trim_backlog \fR=\fPint
1174Trim after this number of blocks are written.
1175.TP
1176.BI trim_backlog_batch \fR=\fPint
1177Trim this number of IO blocks.
1178.TP
1179.BI experimental_verify \fR=\fPbool
1180Enable experimental verification.
1181.TP
d392365e 1182.B stonewall "\fR,\fP wait_for_previous"
5982a925 1183Wait for preceding jobs in the job file to exit before starting this one.
d60e92d1
AC
1184\fBstonewall\fR implies \fBnew_group\fR.
1185.TP
1186.B new_group
1187Start a new reporting group. If not given, all jobs in a file will be part
1188of the same reporting group, unless separated by a stonewall.
1189.TP
1190.BI numjobs \fR=\fPint
1191Number of clones (processes/threads performing the same workload) of this job.
1192Default: 1.
1193.TP
1194.B group_reporting
1195If set, display per-group reports instead of per-job when \fBnumjobs\fR is
1196specified.
1197.TP
1198.B thread
1199Use threads created with \fBpthread_create\fR\|(3) instead of processes created
1200with \fBfork\fR\|(2).
1201.TP
f7fa2653 1202.BI zonesize \fR=\fPint
d60e92d1
AC
1203Divide file into zones of the specified size in bytes. See \fBzoneskip\fR.
1204.TP
fa769d44
SW
1205.BI zonerange \fR=\fPint
1206Give size of an IO zone. See \fBzoneskip\fR.
1207.TP
f7fa2653 1208.BI zoneskip \fR=\fPint
d1429b5c 1209Skip the specified number of bytes when \fBzonesize\fR bytes of data have been
d60e92d1
AC
1210read.
1211.TP
1212.BI write_iolog \fR=\fPstr
5b42a488
SH
1213Write the issued I/O patterns to the specified file. Specify a separate file
1214for each job, otherwise the iologs will be interspersed and the file may be
1215corrupt.
d60e92d1
AC
1216.TP
1217.BI read_iolog \fR=\fPstr
1218Replay the I/O patterns contained in the specified file generated by
1219\fBwrite_iolog\fR, or may be a \fBblktrace\fR binary file.
1220.TP
64bbb865
DN
1221.BI replay_no_stall \fR=\fPint
1222While replaying I/O patterns using \fBread_iolog\fR the default behavior
1223attempts to respect timing information between I/Os. Enabling
1224\fBreplay_no_stall\fR causes I/Os to be replayed as fast as possible while
1225still respecting ordering.
1226.TP
d1c46c04
DN
1227.BI replay_redirect \fR=\fPstr
1228While replaying I/O patterns using \fBread_iolog\fR the default behavior
1229is to replay the IOPS onto the major/minor device that each IOP was recorded
1230from. Setting \fBreplay_redirect\fR causes all IOPS to be replayed onto the
1231single specified device regardless of the device it was recorded from.
1232.TP
836bad52 1233.BI write_bw_log \fR=\fPstr
901bb994
JA
1234If given, write a bandwidth log of the jobs in this job file. Can be used to
1235store data of the bandwidth of the jobs in their lifetime. The included
1236fio_generate_plots script uses gnuplot to turn these text files into nice
26b26fca 1237graphs. See \fBwrite_lat_log\fR for behaviour of given filename. For this
8ad3b3dd
JA
1238option, the postfix is _bw.x.log, where x is the index of the job (1..N,
1239where N is the number of jobs)
d60e92d1 1240.TP
836bad52 1241.BI write_lat_log \fR=\fPstr
901bb994 1242Same as \fBwrite_bw_log\fR, but writes I/O completion latencies. If no
8ad3b3dd
JA
1243filename is given with this option, the default filename of
1244"jobname_type.x.log" is used, where x is the index of the job (1..N, where
1245N is the number of jobs). Even if the filename is given, fio will still
1246append the type of log.
901bb994 1247.TP
c8eeb9df
JA
1248.BI write_iops_log \fR=\fPstr
1249Same as \fBwrite_bw_log\fR, but writes IOPS. If no filename is given with this
8ad3b3dd
JA
1250option, the default filename of "jobname_type.x.log" is used, where x is the
1251index of the job (1..N, where N is the number of jobs). Even if the filename
1252is given, fio will still append the type of log.
c8eeb9df 1253.TP
b8bc8cba
JA
1254.BI log_avg_msec \fR=\fPint
1255By default, fio will log an entry in the iops, latency, or bw log for every
1256IO that completes. When writing to the disk log, that can quickly grow to a
1257very large size. Setting this option makes fio average the each log entry
1258over the specified period of time, reducing the resolution of the log.
1259Defaults to 0.
1260.TP
ae588852
JA
1261.BI log_offset \fR=\fPbool
1262If this is set, the iolog options will include the byte offset for the IO
1263entry as well as the other data values.
1264.TP
aee2ab67
JA
1265.BI log_compression \fR=\fPint
1266If this is set, fio will compress the IO logs as it goes, to keep the memory
1267footprint lower. When a log reaches the specified size, that chunk is removed
1268and compressed in the background. Given that IO logs are fairly highly
1269compressible, this yields a nice memory savings for longer runs. The downside
1270is that the compression will consume some background CPU cycles, so it may
1271impact the run. This, however, is also true if the logging ends up consuming
1272most of the system memory. So pick your poison. The IO logs are saved
1273normally at the end of a run, by decompressing the chunks and storing them
1274in the specified log file. This feature depends on the availability of zlib.
1275.TP
b26317c9
JA
1276.BI log_store_compressed \fR=\fPbool
1277If set, and \fBlog\fR_compression is also set, fio will store the log files in
1278a compressed format. They can be decompressed with fio, using the
1279\fB\-\-inflate-log\fR command line parameter. The files will be stored with a
1280\fB\.fz\fR suffix.
1281.TP
836bad52 1282.BI disable_lat \fR=\fPbool
02af0988 1283Disable measurements of total latency numbers. Useful only for cutting
ccc2b328 1284back the number of calls to \fBgettimeofday\fR\|(2), as that does impact performance at
901bb994
JA
1285really high IOPS rates. Note that to really get rid of a large amount of these
1286calls, this option must be used with disable_slat and disable_bw as well.
1287.TP
836bad52 1288.BI disable_clat \fR=\fPbool
c95f9daf 1289Disable measurements of completion latency numbers. See \fBdisable_lat\fR.
02af0988 1290.TP
836bad52 1291.BI disable_slat \fR=\fPbool
02af0988 1292Disable measurements of submission latency numbers. See \fBdisable_lat\fR.
901bb994 1293.TP
836bad52 1294.BI disable_bw_measurement \fR=\fPbool
02af0988 1295Disable measurements of throughput/bandwidth numbers. See \fBdisable_lat\fR.
d60e92d1 1296.TP
f7fa2653 1297.BI lockmem \fR=\fPint
d60e92d1 1298Pin the specified amount of memory with \fBmlock\fR\|(2). Can be used to
81c6b6cd 1299simulate a smaller amount of memory. The amount specified is per worker.
d60e92d1
AC
1300.TP
1301.BI exec_prerun \fR=\fPstr
1302Before running the job, execute the specified command with \fBsystem\fR\|(3).
ce486495
EV
1303.RS
1304Output is redirected in a file called \fBjobname.prerun.txt\fR
1305.RE
d60e92d1
AC
1306.TP
1307.BI exec_postrun \fR=\fPstr
1308Same as \fBexec_prerun\fR, but the command is executed after the job completes.
ce486495
EV
1309.RS
1310Output is redirected in a file called \fBjobname.postrun.txt\fR
1311.RE
d60e92d1
AC
1312.TP
1313.BI ioscheduler \fR=\fPstr
1314Attempt to switch the device hosting the file to the specified I/O scheduler.
1315.TP
d60e92d1 1316.BI disk_util \fR=\fPbool
d1429b5c 1317Generate disk utilization statistics if the platform supports it. Default: true.
901bb994 1318.TP
23893646
JA
1319.BI clocksource \fR=\fPstr
1320Use the given clocksource as the base of timing. The supported options are:
1321.RS
1322.TP
1323.B gettimeofday
ccc2b328 1324\fBgettimeofday\fR\|(2)
23893646
JA
1325.TP
1326.B clock_gettime
ccc2b328 1327\fBclock_gettime\fR\|(2)
23893646
JA
1328.TP
1329.B cpu
1330Internal CPU clock source
1331.TP
1332.RE
1333.P
1334\fBcpu\fR is the preferred clocksource if it is reliable, as it is very fast
1335(and fio is heavy on time calls). Fio will automatically use this clocksource
1336if it's supported and considered reliable on the system it is running on,
1337unless another clocksource is specifically set. For x86/x86-64 CPUs, this
1338means supporting TSC Invariant.
1339.TP
901bb994 1340.BI gtod_reduce \fR=\fPbool
ccc2b328 1341Enable all of the \fBgettimeofday\fR\|(2) reducing options (disable_clat, disable_slat,
901bb994 1342disable_bw) plus reduce precision of the timeout somewhat to really shrink the
ccc2b328 1343\fBgettimeofday\fR\|(2) call count. With this option enabled, we only do about 0.4% of
901bb994
JA
1344the gtod() calls we would have done if all time keeping was enabled.
1345.TP
1346.BI gtod_cpu \fR=\fPint
1347Sometimes it's cheaper to dedicate a single thread of execution to just getting
1348the current time. Fio (and databases, for instance) are very intensive on
ccc2b328 1349\fBgettimeofday\fR\|(2) calls. With this option, you can set one CPU aside for doing
901bb994
JA
1350nothing but logging current time to a shared memory location. Then the other
1351threads/processes that run IO workloads need only copy that segment, instead of
ccc2b328 1352entering the kernel with a \fBgettimeofday\fR\|(2) call. The CPU set aside for doing
901bb994
JA
1353these time calls will be excluded from other uses. Fio will manually clear it
1354from the CPU mask of other jobs.
f2bba182 1355.TP
8b28bd41
DM
1356.BI ignore_error \fR=\fPstr
1357Sometimes you want to ignore some errors during test in that case you can specify
1358error list for each error type.
1359.br
1360ignore_error=READ_ERR_LIST,WRITE_ERR_LIST,VERIFY_ERR_LIST
1361.br
1362errors for given error type is separated with ':'.
1363Error may be symbol ('ENOSPC', 'ENOMEM') or an integer.
1364.br
1365Example: ignore_error=EAGAIN,ENOSPC:122 .
1366.br
1367This option will ignore EAGAIN from READ, and ENOSPC and 122(EDQUOT) from WRITE.
1368.TP
1369.BI error_dump \fR=\fPbool
1370If set dump every error even if it is non fatal, true by default. If disabled
1371only fatal error will be dumped
1372.TP
fa769d44
SW
1373.BI profile \fR=\fPstr
1374Select a specific builtin performance test.
1375.TP
a696fa2a
JA
1376.BI cgroup \fR=\fPstr
1377Add job to this control group. If it doesn't exist, it will be created.
6adb38a1
JA
1378The system must have a mounted cgroup blkio mount point for this to work. If
1379your system doesn't have it mounted, you can do so with:
1380
5982a925 1381# mount \-t cgroup \-o blkio none /cgroup
a696fa2a
JA
1382.TP
1383.BI cgroup_weight \fR=\fPint
1384Set the weight of the cgroup to this value. See the documentation that comes
1385with the kernel, allowed values are in the range of 100..1000.
e0b0d892 1386.TP
7de87099
VG
1387.BI cgroup_nodelete \fR=\fPbool
1388Normally fio will delete the cgroups it has created after the job completion.
1389To override this behavior and to leave cgroups around after the job completion,
1390set cgroup_nodelete=1. This can be useful if one wants to inspect various
1391cgroup files after job completion. Default: false
1392.TP
e0b0d892
JA
1393.BI uid \fR=\fPint
1394Instead of running as the invoking user, set the user ID to this value before
1395the thread/process does any work.
1396.TP
1397.BI gid \fR=\fPint
1398Set group ID, see \fBuid\fR.
83349190 1399.TP
fa769d44
SW
1400.BI unit_base \fR=\fPint
1401Base unit for reporting. Allowed values are:
1402.RS
1403.TP
1404.B 0
1405Use auto-detection (default).
1406.TP
1407.B 8
1408Byte based.
1409.TP
1410.B 1
1411Bit based.
1412.RE
1413.P
1414.TP
9e684a49
DE
1415.BI flow_id \fR=\fPint
1416The ID of the flow. If not specified, it defaults to being a global flow. See
1417\fBflow\fR.
1418.TP
1419.BI flow \fR=\fPint
1420Weight in token-based flow control. If this value is used, then there is a
1421\fBflow counter\fR which is used to regulate the proportion of activity between
1422two or more jobs. fio attempts to keep this flow counter near zero. The
1423\fBflow\fR parameter stands for how much should be added or subtracted to the
1424flow counter on each iteration of the main I/O loop. That is, if one job has
1425\fBflow=8\fR and another job has \fBflow=-1\fR, then there will be a roughly
14261:8 ratio in how much one runs vs the other.
1427.TP
1428.BI flow_watermark \fR=\fPint
1429The maximum value that the absolute value of the flow counter is allowed to
1430reach before the job must wait for a lower value of the counter.
1431.TP
1432.BI flow_sleep \fR=\fPint
1433The period of time, in microseconds, to wait after the flow watermark has been
1434exceeded before retrying operations
1435.TP
83349190
YH
1436.BI clat_percentiles \fR=\fPbool
1437Enable the reporting of percentiles of completion latencies.
1438.TP
1439.BI percentile_list \fR=\fPfloat_list
1440Overwrite the default list of percentiles for completion
1441latencies. Each number is a floating number in the range (0,100], and
1442the maximum length of the list is 20. Use ':' to separate the
3eb07285 1443numbers. For example, \-\-percentile_list=99.5:99.9 will cause fio to
83349190
YH
1444report the values of completion latency below which 99.5% and 99.9% of
1445the observed latencies fell, respectively.
de890a1e
SL
1446.SS "Ioengine Parameters List"
1447Some parameters are only valid when a specific ioengine is in use. These are
1448used identically to normal parameters, with the caveat that when used on the
cf145d90 1449command line, they must come after the ioengine.
de890a1e 1450.TP
e4585935
JA
1451.BI (cpu)cpuload \fR=\fPint
1452Attempt to use the specified percentage of CPU cycles.
1453.TP
1454.BI (cpu)cpuchunks \fR=\fPint
1455Split the load into cycles of the given time. In microseconds.
1456.TP
046395d7
JA
1457.BI (cpu)exit_on_io_done \fR=\fPbool
1458Detect when IO threads are done, then exit.
1459.TP
de890a1e
SL
1460.BI (libaio)userspace_reap
1461Normally, with the libaio engine in use, fio will use
1462the io_getevents system call to reap newly returned events.
1463With this flag turned on, the AIO ring will be read directly
1464from user-space to reap events. The reaping mode is only
1465enabled when polling for a minimum of 0 events (eg when
1466iodepth_batch_complete=0).
1467.TP
1468.BI (net,netsplice)hostname \fR=\fPstr
1469The host name or IP address to use for TCP or UDP based IO.
1470If the job is a TCP listener or UDP reader, the hostname is not
b511c9aa 1471used and must be omitted unless it is a valid UDP multicast address.
de890a1e
SL
1472.TP
1473.BI (net,netsplice)port \fR=\fPint
6315af9d
JA
1474The TCP or UDP port to bind to or connect to. If this is used with
1475\fBnumjobs\fR to spawn multiple instances of the same job type, then
1476this will be the starting port number since fio will use a range of ports.
de890a1e 1477.TP
b93b6a2e
SB
1478.BI (net,netsplice)interface \fR=\fPstr
1479The IP address of the network interface used to send or receive UDP multicast
1480packets.
1481.TP
d3a623de
SB
1482.BI (net,netsplice)ttl \fR=\fPint
1483Time-to-live value for outgoing UDP multicast packets. Default: 1
1484.TP
1d360ffb
JA
1485.BI (net,netsplice)nodelay \fR=\fPbool
1486Set TCP_NODELAY on TCP connections.
1487.TP
de890a1e
SL
1488.BI (net,netsplice)protocol \fR=\fPstr "\fR,\fP proto" \fR=\fPstr
1489The network protocol to use. Accepted values are:
1490.RS
1491.RS
1492.TP
1493.B tcp
1494Transmission control protocol
1495.TP
49ccb8c1
JA
1496.B tcpv6
1497Transmission control protocol V6
1498.TP
de890a1e 1499.B udp
f5cc3d0e 1500User datagram protocol
de890a1e 1501.TP
49ccb8c1
JA
1502.B udpv6
1503User datagram protocol V6
1504.TP
de890a1e
SL
1505.B unix
1506UNIX domain socket
1507.RE
1508.P
1509When the protocol is TCP or UDP, the port must also be given,
1510as well as the hostname if the job is a TCP listener or UDP
1511reader. For unix sockets, the normal filename option should be
1512used and the port is invalid.
1513.RE
1514.TP
1515.BI (net,netsplice)listen
1516For TCP network connections, tell fio to listen for incoming
1517connections rather than initiating an outgoing connection. The
1518hostname must be omitted if this option is used.
d54fce84 1519.TP
7aeb1e94 1520.BI (net, pingpong) \fR=\fPbool
cecbfd47 1521Normally a network writer will just continue writing data, and a network reader
cf145d90 1522will just consume packets. If pingpong=1 is set, a writer will send its normal
7aeb1e94
JA
1523payload to the reader, then wait for the reader to send the same payload back.
1524This allows fio to measure network latencies. The submission and completion
1525latencies then measure local time spent sending or receiving, and the
1526completion latency measures how long it took for the other end to receive and
b511c9aa
SB
1527send back. For UDP multicast traffic pingpong=1 should only be set for a single
1528reader when multiple readers are listening to the same address.
7aeb1e94 1529.TP
1008602c
JA
1530.BI (net, window_size) \fR=\fPint
1531Set the desired socket buffer size for the connection.
1532.TP
e5f34d95
JA
1533.BI (net, mss) \fR=\fPint
1534Set the TCP maximum segment size (TCP_MAXSEG).
1535.TP
d54fce84
DM
1536.BI (e4defrag,donorname) \fR=\fPstr
1537File will be used as a block donor (swap extents between files)
1538.TP
1539.BI (e4defrag,inplace) \fR=\fPint
1540Configure donor file block allocation strategy
1541.RS
1542.BI 0(default) :
1543Preallocate donor's file on init
1544.TP
1545.BI 1:
cecbfd47 1546allocate space immediately inside defragment event, and free right after event
d54fce84 1547.RE
0d978694
DAG
1548.TP
1549.BI (rbd)rbdname \fR=\fPstr
1550Specifies the name of the RBD.
1551.TP
1552.BI (rbd)pool \fR=\fPstr
1553Specifies the name of the Ceph pool containing the RBD.
1554.TP
1555.BI (rbd)clientname \fR=\fPstr
1556Specifies the username (without the 'client.' prefix) used to access the Ceph cluster.
d60e92d1 1557.SH OUTPUT
d1429b5c
AC
1558While running, \fBfio\fR will display the status of the created jobs. For
1559example:
d60e92d1 1560.RS
d1429b5c 1561.P
d60e92d1
AC
1562Threads: 1: [_r] [24.8% done] [ 13509/ 8334 kb/s] [eta 00h:01m:31s]
1563.RE
1564.P
d1429b5c
AC
1565The characters in the first set of brackets denote the current status of each
1566threads. The possible values are:
1567.P
1568.PD 0
d60e92d1
AC
1569.RS
1570.TP
1571.B P
1572Setup but not started.
1573.TP
1574.B C
1575Thread created.
1576.TP
1577.B I
1578Initialized, waiting.
1579.TP
1580.B R
1581Running, doing sequential reads.
1582.TP
1583.B r
1584Running, doing random reads.
1585.TP
1586.B W
1587Running, doing sequential writes.
1588.TP
1589.B w
1590Running, doing random writes.
1591.TP
1592.B M
1593Running, doing mixed sequential reads/writes.
1594.TP
1595.B m
1596Running, doing mixed random reads/writes.
1597.TP
1598.B F
1599Running, currently waiting for \fBfsync\fR\|(2).
1600.TP
1601.B V
1602Running, verifying written data.
1603.TP
1604.B E
1605Exited, not reaped by main thread.
1606.TP
1607.B \-
1608Exited, thread reaped.
1609.RE
d1429b5c 1610.PD
d60e92d1
AC
1611.P
1612The second set of brackets shows the estimated completion percentage of
1613the current group. The third set shows the read and write I/O rate,
1614respectively. Finally, the estimated run time of the job is displayed.
1615.P
1616When \fBfio\fR completes (or is interrupted by Ctrl-C), it will show data
1617for each thread, each group of threads, and each disk, in that order.
1618.P
1619Per-thread statistics first show the threads client number, group-id, and
1620error code. The remaining figures are as follows:
1621.RS
d60e92d1
AC
1622.TP
1623.B io
1624Number of megabytes of I/O performed.
1625.TP
1626.B bw
1627Average data rate (bandwidth).
1628.TP
1629.B runt
1630Threads run time.
1631.TP
1632.B slat
1633Submission latency minimum, maximum, average and standard deviation. This is
1634the time it took to submit the I/O.
1635.TP
1636.B clat
1637Completion latency minimum, maximum, average and standard deviation. This
1638is the time between submission and completion.
1639.TP
1640.B bw
1641Bandwidth minimum, maximum, percentage of aggregate bandwidth received, average
1642and standard deviation.
1643.TP
1644.B cpu
1645CPU usage statistics. Includes user and system time, number of context switches
1646this thread went through and number of major and minor page faults.
1647.TP
1648.B IO depths
1649Distribution of I/O depths. Each depth includes everything less than (or equal)
1650to it, but greater than the previous depth.
1651.TP
1652.B IO issued
1653Number of read/write requests issued, and number of short read/write requests.
1654.TP
1655.B IO latencies
1656Distribution of I/O completion latencies. The numbers follow the same pattern
1657as \fBIO depths\fR.
1658.RE
d60e92d1
AC
1659.P
1660The group statistics show:
d1429b5c 1661.PD 0
d60e92d1
AC
1662.RS
1663.TP
1664.B io
1665Number of megabytes I/O performed.
1666.TP
1667.B aggrb
1668Aggregate bandwidth of threads in the group.
1669.TP
1670.B minb
1671Minimum average bandwidth a thread saw.
1672.TP
1673.B maxb
1674Maximum average bandwidth a thread saw.
1675.TP
1676.B mint
d1429b5c 1677Shortest runtime of threads in the group.
d60e92d1
AC
1678.TP
1679.B maxt
1680Longest runtime of threads in the group.
1681.RE
d1429b5c 1682.PD
d60e92d1
AC
1683.P
1684Finally, disk statistics are printed with reads first:
d1429b5c 1685.PD 0
d60e92d1
AC
1686.RS
1687.TP
1688.B ios
1689Number of I/Os performed by all groups.
1690.TP
1691.B merge
1692Number of merges in the I/O scheduler.
1693.TP
1694.B ticks
1695Number of ticks we kept the disk busy.
1696.TP
1697.B io_queue
1698Total time spent in the disk queue.
1699.TP
1700.B util
1701Disk utilization.
1702.RE
d1429b5c 1703.PD
8423bd11
JA
1704.P
1705It is also possible to get fio to dump the current output while it is
1706running, without terminating the job. To do that, send fio the \fBUSR1\fR
1707signal.
d60e92d1 1708.SH TERSE OUTPUT
2b8c71b0
CE
1709If the \fB\-\-minimal\fR / \fB\-\-append-terse\fR options are given, the
1710results will be printed/appended in a semicolon-delimited format suitable for
1711scripted use.
1712A job description (if provided) follows on a new line. Note that the first
525c2bfa
JA
1713number in the line is the version number. If the output has to be changed
1714for some reason, this number will be incremented by 1 to signify that
1715change. The fields are:
d60e92d1
AC
1716.P
1717.RS
5e726d0a 1718.B terse version, fio version, jobname, groupid, error
d60e92d1
AC
1719.P
1720Read status:
1721.RS
312b4af2 1722.B Total I/O \fR(KB)\fP, bandwidth \fR(KB/s)\fP, IOPS, runtime \fR(ms)\fP
d60e92d1
AC
1723.P
1724Submission latency:
1725.RS
1726.B min, max, mean, standard deviation
1727.RE
1728Completion latency:
1729.RS
1730.B min, max, mean, standard deviation
1731.RE
1db92cb6
JA
1732Completion latency percentiles (20 fields):
1733.RS
1734.B Xth percentile=usec
1735.RE
525c2bfa
JA
1736Total latency:
1737.RS
1738.B min, max, mean, standard deviation
1739.RE
d60e92d1
AC
1740Bandwidth:
1741.RS
1742.B min, max, aggregate percentage of total, mean, standard deviation
1743.RE
1744.RE
1745.P
1746Write status:
1747.RS
312b4af2 1748.B Total I/O \fR(KB)\fP, bandwidth \fR(KB/s)\fP, IOPS, runtime \fR(ms)\fP
d60e92d1
AC
1749.P
1750Submission latency:
1751.RS
1752.B min, max, mean, standard deviation
1753.RE
1754Completion latency:
1755.RS
1756.B min, max, mean, standard deviation
1757.RE
1db92cb6
JA
1758Completion latency percentiles (20 fields):
1759.RS
1760.B Xth percentile=usec
1761.RE
525c2bfa
JA
1762Total latency:
1763.RS
1764.B min, max, mean, standard deviation
1765.RE
d60e92d1
AC
1766Bandwidth:
1767.RS
1768.B min, max, aggregate percentage of total, mean, standard deviation
1769.RE
1770.RE
1771.P
d1429b5c 1772CPU usage:
d60e92d1 1773.RS
bd2626f0 1774.B user, system, context switches, major page faults, minor page faults
d60e92d1
AC
1775.RE
1776.P
1777IO depth distribution:
1778.RS
1779.B <=1, 2, 4, 8, 16, 32, >=64
1780.RE
1781.P
562c2d2f 1782IO latency distribution:
d60e92d1 1783.RS
562c2d2f
DN
1784Microseconds:
1785.RS
1786.B <=2, 4, 10, 20, 50, 100, 250, 500, 750, 1000
1787.RE
1788Milliseconds:
1789.RS
1790.B <=2, 4, 10, 20, 50, 100, 250, 500, 750, 1000, 2000, >=2000
1791.RE
1792.RE
1793.P
f2f788dd
JA
1794Disk utilization (1 for each disk used):
1795.RS
1796.B name, read ios, write ios, read merges, write merges, read ticks, write ticks, read in-queue time, write in-queue time, disk utilization percentage
1797.RE
1798.P
5982a925 1799Error Info (dependent on continue_on_error, default off):
562c2d2f
DN
1800.RS
1801.B total # errors, first error code
d60e92d1
AC
1802.RE
1803.P
562c2d2f 1804.B text description (if provided in config - appears on newline)
d60e92d1 1805.RE
49da1240
JA
1806.SH CLIENT / SERVER
1807Normally you would run fio as a stand-alone application on the machine
1808where the IO workload should be generated. However, it is also possible to
1809run the frontend and backend of fio separately. This makes it possible to
1810have a fio server running on the machine(s) where the IO workload should
1811be running, while controlling it from another machine.
1812
1813To start the server, you would do:
1814
1815\fBfio \-\-server=args\fR
1816
1817on that machine, where args defines what fio listens to. The arguments
811826be 1818are of the form 'type:hostname or IP:port'. 'type' is either 'ip' (or ip4)
20c67f10
MS
1819for TCP/IP v4, 'ip6' for TCP/IP v6, or 'sock' for a local unix domain
1820socket. 'hostname' is either a hostname or IP address, and 'port' is the port to
811826be 1821listen to (only valid for TCP/IP, not a local socket). Some examples:
49da1240 1822
e01e9745 18231) fio \-\-server
49da1240
JA
1824
1825 Start a fio server, listening on all interfaces on the default port (8765).
1826
e01e9745 18272) fio \-\-server=ip:hostname,4444
49da1240
JA
1828
1829 Start a fio server, listening on IP belonging to hostname and on port 4444.
1830
e01e9745 18313) fio \-\-server=ip6:::1,4444
811826be
JA
1832
1833 Start a fio server, listening on IPv6 localhost ::1 and on port 4444.
1834
e01e9745 18354) fio \-\-server=,4444
49da1240
JA
1836
1837 Start a fio server, listening on all interfaces on port 4444.
1838
e01e9745 18395) fio \-\-server=1.2.3.4
49da1240
JA
1840
1841 Start a fio server, listening on IP 1.2.3.4 on the default port.
1842
e01e9745 18436) fio \-\-server=sock:/tmp/fio.sock
49da1240
JA
1844
1845 Start a fio server, listening on the local socket /tmp/fio.sock.
1846
1847When a server is running, you can connect to it from a client. The client
1848is run with:
1849
e01e9745 1850fio \-\-local-args \-\-client=server \-\-remote-args <job file(s)>
49da1240 1851
e01e9745
MS
1852where \-\-local-args are arguments that are local to the client where it is
1853running, 'server' is the connect string, and \-\-remote-args and <job file(s)>
49da1240
JA
1854are sent to the server. The 'server' string follows the same format as it
1855does on the server side, to allow IP/hostname/socket and port strings.
1856You can connect to multiple clients as well, to do that you could run:
1857
e01e9745 1858fio \-\-client=server2 \-\-client=server2 <job file(s)>
323255cc
JA
1859
1860If the job file is located on the fio server, then you can tell the server
1861to load a local file as well. This is done by using \-\-remote-config:
1862
1863fio \-\-client=server \-\-remote-config /path/to/file.fio
1864
1865Then the fio serer will open this local (to the server) job file instead
1866of being passed one from the client.
d60e92d1 1867.SH AUTHORS
49da1240 1868
d60e92d1 1869.B fio
aa58d252
JA
1870was written by Jens Axboe <jens.axboe@oracle.com>,
1871now Jens Axboe <jaxboe@fusionio.com>.
d1429b5c
AC
1872.br
1873This man page was written by Aaron Carroll <aaronc@cse.unsw.edu.au> based
d60e92d1
AC
1874on documentation by Jens Axboe.
1875.SH "REPORTING BUGS"
482900c9 1876Report bugs to the \fBfio\fR mailing list <fio@vger.kernel.org>.
d1429b5c 1877See \fBREADME\fR.
d60e92d1 1878.SH "SEE ALSO"
d1429b5c
AC
1879For further documentation see \fBHOWTO\fR and \fBREADME\fR.
1880.br
1881Sample jobfiles are available in the \fBexamples\fR directory.
d60e92d1 1882