setgid first, setuid second
[fio.git] / README
CommitLineData
ebac4655
JA
1fio
2---
3
79809113
JA
4fio is a tool that will spawn a number of threads or processes doing a
5particular type of io action as specified by the user. fio takes a
6number of global parameters, each inherited by the thread unless
7otherwise parameters given to them overriding that setting is given.
8The typical use of fio is to write a job file matching the io load
9one wants to simulate.
ebac4655 10
2b02b546
JA
11
12Source
13------
14
15fio resides in a git repo, the canonical place is:
16
6b3eccb1 17git://git.kernel.dk/fio.git
97f049c9
JA
18
19The http protocol also works, path is the same.
2b02b546 20
79809113
JA
21Snapshots are frequently generated and they include the git meta data as
22well. You can download them here:
2b02b546
JA
23
24http://brick.kernel.dk/snaps/
25
1053a106 26
d85b1add
SK
27Binary packages
28---------------
29
30Debian:
31Starting with Debian "Squeeze", fio packages are part of the official
32Debian repository. http://packages.debian.org/search?keywords=fio
33
34Ubuntu:
35Starting with Ubuntu 10.04 LTS (aka "Lucid Lynx"), fio packages are part
36of the Ubuntu "universe" repository.
37http://packages.ubuntu.com/search?keywords=fio
38
39SUSE:
40Pascal Bleser <guru@unixtech.be> has fio RPMs in his repository for SUSE
41variants, you can find them here:
1053a106
JA
42http://linux01.gwdg.de/~pbleser/rpm-navigation.php?cat=System/fio
43
d85b1add 44Red Hat, CentOS & Co:
a68594cb 45Dag Wieërs has RPMs for Red Hat related distros, find them here:
a68594cb
JA
46http://dag.wieers.com/rpm/packages/fio/
47
d85b1add 48Mandriva:
244e170e
JA
49Mandriva has integrated fio into their package repository, so installing
50on that distro should be as easy as typing 'urpmi fio'.
51
d85b1add
SK
52Solaris:
53Packages for Solaris are available from OpenCSW. Install their pkgutil
54tool (http://www.opencsw.org/get-it/pkgutil/) and then install fio via
55'pkgutil -i fio'.
56
2b02b546 57
726f6ff0
JA
58Mailing list
59------------
60
61There's a mailing list associated with fio. It's meant for general
2e8552b0
JA
62discussion, bug reporting, questions, and development - basically anything
63that has to do with fio. An automated mail detailing recent commits is
64automatically sent to the list at most daily. The list address is
65fio@vger.kernel.org, subscribe by sending an email to
66majordomo@vger.kernel.org with
67
68subscribe fio
69
4f5d1526
EIB
70in the body of the email. Archives can be found here:
71
72http://www.spinics.net/lists/fio/
73
74and archives for the old list can be found here:
2e8552b0
JA
75
76http://maillist.kernel.dk/fio-devel/
726f6ff0
JA
77
78
bbfd6b00
JA
79Building
80--------
81
82Just type 'make' and 'make install'. If on FreeBSD, for now you have to
5f421004 83specify the FreeBSD Makefile with -f and use gmake (not make), eg:
bbfd6b00 84
5f421004 85$ gmake -f Makefile.Freebsd && gmake -f Makefile.FreeBSD install
bbfd6b00 86
bf2e821a
CC
87Same goes for AIX:
88
89$ gmake -f Makefile.aix && gmake -f Makefile.aix install
90
edffcb96 91Likewise with OpenSolaris, use the Makefile.solaris to compile there.
5f421004
JA
92The OpenSolaris make should work fine. This might change in the
93future if I opt for an autoconf type setup.
bbfd6b00 94
6de43c1b
JA
95If your compile fails with an error like this:
96
97 CC gettime.o
98In file included from fio.h:23,
99 from gettime.c:8:
100os/os.h:15:20: error: libaio.h: No such file or directory
101In file included from gettime.c:8:
102fio.h:119: error: field 'iocb' has incomplete type
103make: *** [gettime.o] Error 1
104
105Check that you have the libaio development package installed. On RPM
106based distros, it's typically called libaio-devel.
107
bbfd6b00 108
972cfd25
JA
109Command line
110------------
ebac4655
JA
111
112$ fio
ee56ad50 113 --debug Enable some debugging options (see below)
b4692828 114 --output Write output to file
062c6022 115 --timeout Runtime in seconds
b4692828
JA
116 --latency-log Generate per-job latency logs
117 --bandwidth-log Generate per-job bandwidth logs
118 --minimal Minimal (terse) output
119 --version Print version info and exit
fd28ca49
JA
120 --help Print this page
121 --cmdhelp=cmd Print command help, "all" for all of them
cca73aa7 122 --showcmd Turn a job file into command line options
062c6022 123 --readonly Turn on safety read-only checks, preventing writes
e592a06b
AC
124 --eta=when When ETA estimate should be printed
125 May be "always", "never" or "auto"
01f06b63 126 --section=name Only run specified section in job file
2b386d25 127 --alloc-size=kb Set smalloc pool to this size in kb (def 1024)
e592a06b 128
b4692828
JA
129
130Any parameters following the options will be assumed to be job files,
131unless they match a job file parameter. You can add as many as you want,
132each job file will be regarded as a separate group and fio will stonewall
133its execution.
972cfd25 134
724e4435
JA
135The --readonly switch is an extra safety guard to prevent accidentically
136turning on a write setting when that is not desired. Fio will only write
137if rw=write/randwrite/rw/randrw is given, but this extra safety net can
138be used as an extra precaution. It will also enable a write check in the
139io engine core to prevent an accidental write due to a fio bug.
140
ee56ad50
JA
141The debug switch allows adding options that trigger certain logging
142options in fio. Currently the options are:
143
144 process Dump info related to processes
145 file Dump info related to file actions
146 io Dump info related to IO queuing
147 mem Dump info related to memory allocations
bd6f78b2
JA
148 blktrace Dump info related to blktrace setup
149 verify Dump info related to IO verification
150 all Enable all debug options
811a0d06 151 random Dump info related to random offset generation
a3d741fa 152 parse Dump info related to option matching and parsing
cd991b9e 153 diskutil Dump info related to disk utilization updates
5e1d306e 154 job:x Dump info only related to job number x
29adda3c 155 mutex Dump info only related to mutex up/down ops
c223da83
JA
156 profile Dump info related to profile extensions
157 time Dump info related to internal time keeping
bd6f78b2 158 ? or help Show available debug options.
ee56ad50
JA
159
160You can specify as many as you want, eg --debug=file,mem will enable
bd6f78b2 161file and memory debugging.
ee56ad50 162
01f06b63
JA
163The section switch is meant to make it easier to ship a bigger job file
164instead of several smaller ones. Say you define a job file with light,
165moderate, and heavy parts. Then you can ask fio to run the given part
166only by giving it a --section=heavy command line option. The section
167option only applies to job sections, the reserved 'global' section is
168always parsed and taken into account.
169
2b386d25
JA
170Fio has an internal allocator for shared memory called smalloc. It
171allocates shared structures from this pool. The pool defaults to 1024k
931823ca 172in size, and can grow to 128 pools. If running large jobs with randommap
2b386d25 173enabled it can run out of memory, in which case the --alloc-size switch
931823ca
JA
174is handy for starting with a larger pool size. The backing store is
175files in /tmp. Fio cleans up after itself, while it is running you
176may see .fio_smalloc.* files in /tmp.
2b386d25 177
79809113
JA
178
179Job file
180--------
181
71bfa161 182See the HOWTO file for a more detailed description of parameters and what
4661f3d0
JA
183they mean. This file contains the terse version. You can describe big and
184complex setups with the command line, but generally it's a lot easier to
71bfa161 185just write a simple job file to describe the workload. The job file format
4661f3d0 186is in the ini style format, as that is easy to read and write for the user.
79809113
JA
187
188The job file parameters are:
ebac4655 189
01452055 190 name=x Use 'x' as the identifier for this job.
61697c37 191 description=x 'x' is a text description of the job.
ebac4655 192 directory=x Use 'x' as the top level directory for storing files
b50b8755
JA
193 filename=x Force the use of 'x' as the filename for all files
194 in this thread. If not given, fio will make up
195 a suitable filename based on the thread and file
196 number.
3d60d1ed
JA
197 rw=x 'x' may be: read, randread, write, randwrite,
198 rw (read-write mix), randrw (read-write random mix)
a6ccc7be
JA
199 rwmixcycle=x Base cycle for switching between read and write
200 in msecs.
201 rwmixread=x 'x' percentage of rw mix ios will be reads. If
202 rwmixwrite is also given, the last of the two will
203 be used if they don't add up to 100%.
204 rwmixwrite=x 'x' percentage of rw mix ios will be writes. See
205 rwmixread.
9ebc27e1
JA
206 rand_repeatable=x The sequence of random io blocks can be repeatable
207 across runs, if 'x' is 1.
ebac4655
JA
208 size=x Set file size to x bytes (x string can include k/m/g)
209 ioengine=x 'x' may be: aio/libaio/linuxaio for Linux aio,
78e7b3e7
JA
210 posixaio for POSIX aio, solarisaio for Solaris
211 native async IO, sync for regular read/write io,
1d2af02a
JA
212 psync for regular pread/pwrite io, vsync for regular
213 readv/writev (with queuing emulation) mmap for mmap'ed
214 io, syslet-rw for syslet driven read/write, splice for
d0c70934
GP
215 using splice/vmsplice, sg for direct SG_IO io, net
216 for network io, or cpuio for a cycler burner load. sg
1d2af02a
JA
217 only works on Linux on SCSI (or SCSI-like devices, such
218 as usb-storage or sata/libata driven) devices. Fio also
219 has a null io engine, which is mainly used for testing
220 fio itself.
221
ebac4655
JA
222 iodepth=x For async io, allow 'x' ios in flight
223 overwrite=x If 'x', layout a write file first.
53cdc686
JA
224 nrfiles=x Spread io load over 'x' number of files per job,
225 if possible.
ebac4655
JA
226 prio=x Run io at prio X, 0-7 is the kernel allowed range
227 prioclass=x Run io at prio class X
228 bs=x Use 'x' for thread blocksize. May include k/m postfix.
229 bsrange=x-y Mix thread block sizes randomly between x and y. May
230 also include k/m postfix.
231 direct=x 1 for direct IO, 0 for buffered IO
232 thinktime=x "Think" x usec after each io
b22989b9
JA
233 rate=x Throttle rate to x KB/sec
234 ratemin=x Quit if rate of x KB/sec can't be met
ebac4655
JA
235 ratecycle=x ratemin averaged over x msecs
236 cpumask=x Only allow job to run on CPUs defined by mask.
d2e268b0 237 cpus_allowed=x Like 'cpumask', but allow text setting of CPU affinity.
795407ca
JA
238 fsync=x If writing with buffered IO, fsync after every
239 'x' blocks have been written.
240 end_fsync=x If 'x', run fsync() after end-of-job.
ebac4655 241 startdelay=x Start this thread x seconds after startup
03b74b3e 242 runtime=x Terminate x seconds after startup. Can include a
906c8d75
JA
243 normal time suffix if not given in seconds, such as
244 'm' for minutes, 'h' for hours, and 'd' for days.
ebac4655
JA
245 offset=x Start io at offset x (x string can include k/m/g)
246 invalidate=x Invalidate page cache for file prior to doing io
795407ca 247 sync=x Use sync writes if x and writing buffered IO.
ebac4655 248 mem=x If x == malloc, use malloc for buffers. If x == shm,
795407ca
JA
249 use shared memory for buffers. If x == mmap, use
250 anonymous mmap.
ebac4655
JA
251 exitall When one thread quits, terminate the others
252 bwavgtime=x Average bandwidth stats over an x msec window.
253 create_serialize=x If 'x', serialize file creation.
254 create_fsync=x If 'x', run fsync() after file creation.
f6cbb269 255 unlink If set, unlink files when done.
ebac4655
JA
256 loops=x Run the job 'x' number of times.
257 verify=x If 'x' == md5, use md5 for verifies. If 'x' == crc32,
258 use crc32 for verifies. md5 is 'safer', but crc32 is
259 a lot faster. Only makes sense for writing to a file.
bac39e0e 260 For other types of checksumming, see HOWTO.
ebac4655
JA
261 stonewall Wait for preceeding jobs to end before running.
262 numjobs=x Create 'x' similar entries for this job
263 thread Use pthreads instead of forked jobs
20dc95c4
JA
264 zonesize=x
265 zoneskip=y Zone options must be paired. If given, the job
266 will skip y bytes for every x read/written. This
267 can be used to gauge hard drive speed over the entire
268 platter, without reading everything. Both x/y can
269 include k/m/g suffix.
aea47d44
JA
270 iolog=x Open and read io pattern from file 'x'. The file must
271 contain one io action per line in the following format:
272 rw, offset, length
273 where with rw=0/1 for read/write, and the offset
274 and length entries being in bytes.
843a7413
JA
275 write_iolog=x Write an iolog to file 'x' in the same format as iolog.
276 The iolog options are exclusive, if both given the
5b42a488
SH
277 read iolog will be performed. Specify a separate file
278 for each job, otherwise the iologs will be interspersed
279 and the file may be corrupt.
ec94ec56
JA
280 write_bw_log Write a bandwidth log.
281 write_lat_log Write a latency log.
c04f7ec3
JA
282 lockmem=x Lock down x amount of memory on the machine, to
283 simulate a machine with less memory available. x can
284 include k/m/g suffix.
b6f4d880 285 nice=x Run job at given nice value.
4e0ba8af
JA
286 exec_prerun=x Run 'x' before job io is begun.
287 exec_postrun=x Run 'x' after job io has finished.
da86774e 288 ioscheduler=x Use ioscheduler 'x' for this job.
b990b5c0
JA
289 cpuload=x For a CPU io thread, percentage of CPU time to attempt
290 to burn.
ba0fbe10 291 cpuchunks=x Split burn cycles into pieces of x usecs.
ebac4655 292
79809113 293
217bc04b
JA
294
295Platforms
296---------
297
04924a11
JA
298Fio works on (at least) Linux, Solaris, AIX, OSX, NetBSD, and FreeBSD. Some
299features and/or options may only be available on some of the platforms,
300typically because those features only apply to that platform (like the
301solarisaio engine, or the splice engine on Linux).
217bc04b
JA
302
303Some features are not available on FreeBSD/Solaris even if they could be
304implemented, I'd be happy to take patches for that. An example of that is
305disk utility statistics and (I think) huge page support, support for that
306does exist in FreeBSD/Solaris.
307
308Fio uses pthread mutexes for signalling and locking and FreeBSD does not
309support process shared pthread mutexes. As a result, only threads are
310supported on FreeBSD. This could be fixed with sysv ipc locking or
311other locking alternatives.
312
313Other *BSD platforms are untested, but fio should work there almost out
314of the box. Since I don't do test runs or even compiles on those platforms,
315your mileage may vary. Sending me patches for other platforms is greatly
316appreciated. There's a lot of value in having the same test/benchmark tool
317available on all platforms.
318
bf2e821a
CC
319Note that POSIX aio is not enabled by default on AIX. If you get messages like:
320
321 Symbol resolution failed for /usr/lib/libc.a(posix_aio.o) because:
322 Symbol _posix_kaio_rdwr (number 2) is not exported from dependent module /unix.
323
324you need to enable POSIX aio. Run the following commands as root:
325
326 # lsdev -C -l posix_aio0
327 posix_aio0 Defined Posix Asynchronous I/O
328 # cfgmgr -l posix_aio0
329 # lsdev -C -l posix_aio0
330 posix_aio0 Available Posix Asynchronous I/O
331
332POSIX aio should work now. To make the change permanent:
333
334 # chdev -l posix_aio0 -P -a autoconfig='available'
335 posix_aio0 changed
217bc04b
JA
336
337
79809113
JA
338Author
339------
340
aae22ca7 341Fio was written by Jens Axboe <axboe@kernel.dk> to enable flexible testing
79809113
JA
342of the Linux IO subsystem and schedulers. He got tired of writing
343specific test applications to simulate a given workload, and found that
344the existing io benchmark/test tools out there weren't flexible enough
345to do what he wanted.
346
aae22ca7 347Jens Axboe <axboe@kernel.dk> 20060905
79809113 348