Fix failure to exit IO loop on some IO sizes
[fio.git] / README
CommitLineData
ebac4655
JA
1fio
2---
3
79809113
JA
4fio is a tool that will spawn a number of threads or processes doing a
5particular type of io action as specified by the user. fio takes a
6number of global parameters, each inherited by the thread unless
7otherwise parameters given to them overriding that setting is given.
8The typical use of fio is to write a job file matching the io load
9one wants to simulate.
ebac4655 10
2b02b546
JA
11
12Source
13------
14
15fio resides in a git repo, the canonical place is:
16
6b3eccb1 17git://git.kernel.dk/fio.git
97f049c9 18
a9bac3f9
JA
19If you are inside a corporate firewall, git:// may not always work for
20you. In that case you can use the http protocol, path is the same:
21
22http://git.kernel.dk/fio.git
2b02b546 23
79809113
JA
24Snapshots are frequently generated and they include the git meta data as
25well. You can download them here:
2b02b546
JA
26
27http://brick.kernel.dk/snaps/
28
1053a106 29
d85b1add
SK
30Binary packages
31---------------
32
33Debian:
34Starting with Debian "Squeeze", fio packages are part of the official
35Debian repository. http://packages.debian.org/search?keywords=fio
36
37Ubuntu:
38Starting with Ubuntu 10.04 LTS (aka "Lucid Lynx"), fio packages are part
39of the Ubuntu "universe" repository.
40http://packages.ubuntu.com/search?keywords=fio
41
42SUSE:
43Pascal Bleser <guru@unixtech.be> has fio RPMs in his repository for SUSE
44variants, you can find them here:
1053a106
JA
45http://linux01.gwdg.de/~pbleser/rpm-navigation.php?cat=System/fio
46
d85b1add 47Red Hat, CentOS & Co:
a68594cb 48Dag Wieërs has RPMs for Red Hat related distros, find them here:
a68594cb
JA
49http://dag.wieers.com/rpm/packages/fio/
50
d85b1add 51Mandriva:
244e170e
JA
52Mandriva has integrated fio into their package repository, so installing
53on that distro should be as easy as typing 'urpmi fio'.
54
d85b1add
SK
55Solaris:
56Packages for Solaris are available from OpenCSW. Install their pkgutil
57tool (http://www.opencsw.org/get-it/pkgutil/) and then install fio via
58'pkgutil -i fio'.
59
ecc314ba
BC
60Windows:
61Bruce Cran <bruce@cran.org.uk> has fio packages for Windows at
62http://www.bluestop.org/fio .
63
2b02b546 64
726f6ff0
JA
65Mailing list
66------------
67
68There's a mailing list associated with fio. It's meant for general
2e8552b0
JA
69discussion, bug reporting, questions, and development - basically anything
70that has to do with fio. An automated mail detailing recent commits is
71automatically sent to the list at most daily. The list address is
72fio@vger.kernel.org, subscribe by sending an email to
73majordomo@vger.kernel.org with
74
75subscribe fio
76
4f5d1526
EIB
77in the body of the email. Archives can be found here:
78
79http://www.spinics.net/lists/fio/
80
81and archives for the old list can be found here:
2e8552b0
JA
82
83http://maillist.kernel.dk/fio-devel/
726f6ff0
JA
84
85
bbfd6b00
JA
86Building
87--------
88
d015e398 89Just type 'make' and 'make install'.
bbfd6b00 90
d015e398
BC
91Note that GNU make is required. On BSD it's available from devel/gmake;
92on Solaris it's in the SUNWgmake package. On platforms where GNU make
93isn't the default, type 'gmake' instead of 'make'.
bbfd6b00 94
6de43c1b
JA
95If your compile fails with an error like this:
96
97 CC gettime.o
98In file included from fio.h:23,
99 from gettime.c:8:
100os/os.h:15:20: error: libaio.h: No such file or directory
101In file included from gettime.c:8:
102fio.h:119: error: field 'iocb' has incomplete type
103make: *** [gettime.o] Error 1
104
105Check that you have the libaio development package installed. On RPM
106based distros, it's typically called libaio-devel.
107
bbfd6b00 108
53adf64f
BC
109Windows
110-------
111
f41862f7
BC
112On Windows Cygwin (http://www.cygwin.com/) is required in order to
113build fio. To create an MSI installer package install WiX 3.7 from
114http://wixtoolset.org and run dobuild.cmd from the
93bcfd20 115os/windows directory.
53adf64f 116
f41862f7
BC
117How to compile FIO on 64-bit Windows:
118
119 1. Install Cygwin (http://www.cygwin.com/setup.exe). Install 'make' and all
120 packages starting with 'mingw64-i686' and 'mingw64-x86_64'.
121 2. Download ftp://sourceware.org/pub/pthreads-win32/prebuilt-dll-2-9-1-release/dll/x64/pthreadGC2.dll
122 and copy to the fio source directory.
123 3. Open the Cygwin Terminal.
124 4. Go to the fio directory (source files).
125 5. Run 'make clean'.
126 6. Run 'make'.
444310ff 127
53adf64f 128
972cfd25
JA
129Command line
130------------
ebac4655
JA
131
132$ fio
1cfd036f
BC
133 --debug Enable some debugging options (see below)
134 --output Write output to file
b2cecdc2 135 --runtime Runtime in seconds
bebe6398
JA
136 --latency-log Generate per-job latency logs
137 --bandwidth-log Generate per-job bandwidth logs
1cfd036f 138 --minimal Minimal (terse) output
f3afa57e 139 --output-format=type Output format (terse,json,normal)
3449ab8c 140 --terse-version=type Terse version output format (default 3, or 2 or 4).
f3afa57e 141 --version Print version info and exit
1cfd036f 142 --help Print this page
23893646 143 --cpuclock-test Perform test/validation of CPU clock
bebe6398 144 --cmdhelp=cmd Print command help, "all" for all of them
de890a1e
SL
145 --enghelp=engine Print ioengine help, or list available ioengines
146 --enghelp=engine,cmd Print help for an ioengine cmd
1cfd036f 147 --showcmd Turn a job file into command line options
ad0a2735 148 --readonly Turn on safety read-only checks, preventing
bebe6398 149 writes
1cfd036f 150 --eta=when When ETA estimate should be printed
bebe6398
JA
151 May be "always", "never" or "auto"
152 --section=name Only run specified section in job file.
153 Multiple sections can be specified.
e7cb819b 154 --alloc-size=kb Set smalloc pool to this size in kb (def 1024)
155 --warnings-fatal Fio parser warnings are fatal
fca70358 156 --max-jobs Maximum number of threads/processes to support
bebe6398
JA
157 --server=args Start backend server. See Client/Server section.
158 --client=host Connect to specified backend.
f2a2ce0e
HL
159 --idle-prof=option Report cpu idleness on a system or percpu basis
160 (option=system,percpu) or run unit work
161 calibration only (option=calibrate).
e592a06b 162
b4692828
JA
163
164Any parameters following the options will be assumed to be job files,
165unless they match a job file parameter. You can add as many as you want,
166each job file will be regarded as a separate group and fio will stonewall
167its execution.
972cfd25 168
ecc314ba 169The --readonly switch is an extra safety guard to prevent accidentally
724e4435
JA
170turning on a write setting when that is not desired. Fio will only write
171if rw=write/randwrite/rw/randrw is given, but this extra safety net can
172be used as an extra precaution. It will also enable a write check in the
173io engine core to prevent an accidental write due to a fio bug.
174
ee56ad50
JA
175The debug switch allows adding options that trigger certain logging
176options in fio. Currently the options are:
177
178 process Dump info related to processes
179 file Dump info related to file actions
e7cb819b 180 io Dump info related to IO queuing
181 mem Dump info related to memory allocations
bd6f78b2
JA
182 blktrace Dump info related to blktrace setup
183 verify Dump info related to IO verification
e7cb819b 184 all Enable all debug options
811a0d06 185 random Dump info related to random offset generation
a3d741fa 186 parse Dump info related to option matching and parsing
cd991b9e 187 diskutil Dump info related to disk utilization updates
5e1d306e 188 job:x Dump info only related to job number x
29adda3c 189 mutex Dump info only related to mutex up/down ops
c223da83
JA
190 profile Dump info related to profile extensions
191 time Dump info related to internal time keeping
bd6f78b2 192 ? or help Show available debug options.
ee56ad50
JA
193
194You can specify as many as you want, eg --debug=file,mem will enable
bd6f78b2 195file and memory debugging.
ee56ad50 196
01f06b63
JA
197The section switch is meant to make it easier to ship a bigger job file
198instead of several smaller ones. Say you define a job file with light,
199moderate, and heavy parts. Then you can ask fio to run the given part
200only by giving it a --section=heavy command line option. The section
201option only applies to job sections, the reserved 'global' section is
202always parsed and taken into account.
203
2b386d25
JA
204Fio has an internal allocator for shared memory called smalloc. It
205allocates shared structures from this pool. The pool defaults to 1024k
931823ca 206in size, and can grow to 128 pools. If running large jobs with randommap
2b386d25 207enabled it can run out of memory, in which case the --alloc-size switch
931823ca
JA
208is handy for starting with a larger pool size. The backing store is
209files in /tmp. Fio cleans up after itself, while it is running you
210may see .fio_smalloc.* files in /tmp.
2b386d25 211
79809113
JA
212
213Job file
214--------
215
71bfa161 216See the HOWTO file for a more detailed description of parameters and what
4661f3d0
JA
217they mean. This file contains the terse version. You can describe big and
218complex setups with the command line, but generally it's a lot easier to
71bfa161 219just write a simple job file to describe the workload. The job file format
4661f3d0 220is in the ini style format, as that is easy to read and write for the user.
79809113
JA
221
222The job file parameters are:
ebac4655 223
01452055 224 name=x Use 'x' as the identifier for this job.
61697c37 225 description=x 'x' is a text description of the job.
ebac4655 226 directory=x Use 'x' as the top level directory for storing files
b50b8755
JA
227 filename=x Force the use of 'x' as the filename for all files
228 in this thread. If not given, fio will make up
229 a suitable filename based on the thread and file
230 number.
3d60d1ed
JA
231 rw=x 'x' may be: read, randread, write, randwrite,
232 rw (read-write mix), randrw (read-write random mix)
a6ccc7be
JA
233 rwmixcycle=x Base cycle for switching between read and write
234 in msecs.
235 rwmixread=x 'x' percentage of rw mix ios will be reads. If
236 rwmixwrite is also given, the last of the two will
237 be used if they don't add up to 100%.
238 rwmixwrite=x 'x' percentage of rw mix ios will be writes. See
239 rwmixread.
9ebc27e1
JA
240 rand_repeatable=x The sequence of random io blocks can be repeatable
241 across runs, if 'x' is 1.
ebac4655
JA
242 size=x Set file size to x bytes (x string can include k/m/g)
243 ioengine=x 'x' may be: aio/libaio/linuxaio for Linux aio,
78e7b3e7 244 posixaio for POSIX aio, solarisaio for Solaris
03e20d68
BC
245 native async IO, windowsaio for Windows native async IO,
246 sync for regular read/write io,
1d2af02a
JA
247 psync for regular pread/pwrite io, vsync for regular
248 readv/writev (with queuing emulation) mmap for mmap'ed
249 io, syslet-rw for syslet driven read/write, splice for
d0c70934 250 using splice/vmsplice, sg for direct SG_IO io, net
d0b937ed
YR
251 for network io, rdma for RDMA io, or cpuio for a
252 cycler burner load. sg only works on Linux on
253 SCSI (or SCSI-like devices, such as usb-storage or
254 sata/libata driven) devices. Fio also has a null
255 io engine, which is mainly used for testing
1d2af02a
JA
256 fio itself.
257
ebac4655
JA
258 iodepth=x For async io, allow 'x' ios in flight
259 overwrite=x If 'x', layout a write file first.
53cdc686
JA
260 nrfiles=x Spread io load over 'x' number of files per job,
261 if possible.
ebac4655
JA
262 prio=x Run io at prio X, 0-7 is the kernel allowed range
263 prioclass=x Run io at prio class X
264 bs=x Use 'x' for thread blocksize. May include k/m postfix.
265 bsrange=x-y Mix thread block sizes randomly between x and y. May
266 also include k/m postfix.
267 direct=x 1 for direct IO, 0 for buffered IO
268 thinktime=x "Think" x usec after each io
b22989b9
JA
269 rate=x Throttle rate to x KB/sec
270 ratemin=x Quit if rate of x KB/sec can't be met
ebac4655
JA
271 ratecycle=x ratemin averaged over x msecs
272 cpumask=x Only allow job to run on CPUs defined by mask.
d2e268b0 273 cpus_allowed=x Like 'cpumask', but allow text setting of CPU affinity.
d0b937ed
YR
274 numa_cpu_nodes=x,y-z Allow job to run on specified NUMA nodes' CPU.
275 numa_mem_policy=m:x,y-z Setup numa memory allocation policy.
276 'm' stands for policy, such as local, interleave,
277 bind, prefer, local. 'x, y-z' are numa node(s) for
278 memory allocation according to policy.
795407ca
JA
279 fsync=x If writing with buffered IO, fsync after every
280 'x' blocks have been written.
281 end_fsync=x If 'x', run fsync() after end-of-job.
ebac4655 282 startdelay=x Start this thread x seconds after startup
03b74b3e 283 runtime=x Terminate x seconds after startup. Can include a
906c8d75
JA
284 normal time suffix if not given in seconds, such as
285 'm' for minutes, 'h' for hours, and 'd' for days.
ebac4655
JA
286 offset=x Start io at offset x (x string can include k/m/g)
287 invalidate=x Invalidate page cache for file prior to doing io
795407ca 288 sync=x Use sync writes if x and writing buffered IO.
ebac4655 289 mem=x If x == malloc, use malloc for buffers. If x == shm,
795407ca
JA
290 use shared memory for buffers. If x == mmap, use
291 anonymous mmap.
ebac4655
JA
292 exitall When one thread quits, terminate the others
293 bwavgtime=x Average bandwidth stats over an x msec window.
294 create_serialize=x If 'x', serialize file creation.
295 create_fsync=x If 'x', run fsync() after file creation.
f6cbb269 296 unlink If set, unlink files when done.
ebac4655
JA
297 loops=x Run the job 'x' number of times.
298 verify=x If 'x' == md5, use md5 for verifies. If 'x' == crc32,
299 use crc32 for verifies. md5 is 'safer', but crc32 is
300 a lot faster. Only makes sense for writing to a file.
bac39e0e 301 For other types of checksumming, see HOWTO.
ebac4655
JA
302 stonewall Wait for preceeding jobs to end before running.
303 numjobs=x Create 'x' similar entries for this job
304 thread Use pthreads instead of forked jobs
20dc95c4
JA
305 zonesize=x
306 zoneskip=y Zone options must be paired. If given, the job
307 will skip y bytes for every x read/written. This
308 can be used to gauge hard drive speed over the entire
309 platter, without reading everything. Both x/y can
310 include k/m/g suffix.
25c8b9d7
PD
311 read_iolog=x Open and read io pattern from file 'x'. The file format
312 is described in the HOWTO.
843a7413
JA
313 write_iolog=x Write an iolog to file 'x' in the same format as iolog.
314 The iolog options are exclusive, if both given the
5b42a488
SH
315 read iolog will be performed. Specify a separate file
316 for each job, otherwise the iologs will be interspersed
317 and the file may be corrupt.
ec94ec56
JA
318 write_bw_log Write a bandwidth log.
319 write_lat_log Write a latency log.
c04f7ec3
JA
320 lockmem=x Lock down x amount of memory on the machine, to
321 simulate a machine with less memory available. x can
322 include k/m/g suffix.
b6f4d880 323 nice=x Run job at given nice value.
4e0ba8af
JA
324 exec_prerun=x Run 'x' before job io is begun.
325 exec_postrun=x Run 'x' after job io has finished.
da86774e 326 ioscheduler=x Use ioscheduler 'x' for this job.
b990b5c0
JA
327 cpuload=x For a CPU io thread, percentage of CPU time to attempt
328 to burn.
ba0fbe10 329 cpuchunks=x Split burn cycles into pieces of x usecs.
ebac4655 330
79809113 331
217bc04b 332
bebe6398
JA
333Client/server
334------------
335
336Normally you would run fio as a stand-alone application on the machine
337where the IO workload should be generated. However, it is also possible to
338run the frontend and backend of fio separately. This makes it possible to
339have a fio server running on the machine(s) where the IO workload should
340be running, while controlling it from another machine.
341
342To start the server, you would do:
343
344fio --server=args
345
346on that machine, where args defines what fio listens to. The arguments
811826be
JA
347are of the form 'type,hostname or IP,port'. 'type' is either 'ip' (or ip4)
348for TCP/IP v4, 'ip6' for TCP/IP v6, or 'sock' for a local unix domain socket.
349'hostname' is either a hostname or IP address, and 'port' is the port to
350listen to (only valid for TCP/IP, not a local socket). Some examples:
bebe6398
JA
351
3521) fio --server
353
354 Start a fio server, listening on all interfaces on the default port (8765).
355
811826be 3562) fio --server=ip:hostname,4444
bebe6398
JA
357
358 Start a fio server, listening on IP belonging to hostname and on port 4444.
359
811826be
JA
3603) fio --server=ip6:::1,4444
361
362 Start a fio server, listening on IPv6 localhost ::1 and on port 4444.
363
3644) fio --server=,4444
bebe6398
JA
365
366 Start a fio server, listening on all interfaces on port 4444.
367
811826be 3685) fio --server=1.2.3.4
bebe6398
JA
369
370 Start a fio server, listening on IP 1.2.3.4 on the default port.
371
811826be 3726) fio --server=sock:/tmp/fio.sock
bebe6398
JA
373
374 Start a fio server, listening on the local socket /tmp/fio.sock.
375
376When a server is running, you can connect to it from a client. The client
377is run with:
378
379fio --local-args --client=server --remote-args <job file(s)>
380
381where --local-args are arguments that are local to the client where it is
382running, 'server' is the connect string, and --remote-args and <job file(s)>
383are sent to the server. The 'server' string follows the same format as it
384does on the server side, to allow IP/hostname/socket and port strings.
385You can connect to multiple clients as well, to do that you could run:
386
a7321eed 387fio --client=server2 <job file(s)> --client=server2 <job file(s)>
bebe6398
JA
388
389
217bc04b
JA
390Platforms
391---------
392
ce600ac9
JA
393Fio works on (at least) Linux, Solaris, AIX, HP-UX, OSX, NetBSD, Windows
394and FreeBSD. Some features and/or options may only be available on some of
395the platforms, typically because those features only apply to that platform
396(like the solarisaio engine, or the splice engine on Linux).
217bc04b
JA
397
398Some features are not available on FreeBSD/Solaris even if they could be
399implemented, I'd be happy to take patches for that. An example of that is
400disk utility statistics and (I think) huge page support, support for that
401does exist in FreeBSD/Solaris.
402
403Fio uses pthread mutexes for signalling and locking and FreeBSD does not
404support process shared pthread mutexes. As a result, only threads are
405supported on FreeBSD. This could be fixed with sysv ipc locking or
406other locking alternatives.
407
408Other *BSD platforms are untested, but fio should work there almost out
409of the box. Since I don't do test runs or even compiles on those platforms,
410your mileage may vary. Sending me patches for other platforms is greatly
411appreciated. There's a lot of value in having the same test/benchmark tool
412available on all platforms.
413
bf2e821a
CC
414Note that POSIX aio is not enabled by default on AIX. If you get messages like:
415
416 Symbol resolution failed for /usr/lib/libc.a(posix_aio.o) because:
417 Symbol _posix_kaio_rdwr (number 2) is not exported from dependent module /unix.
418
419you need to enable POSIX aio. Run the following commands as root:
420
421 # lsdev -C -l posix_aio0
422 posix_aio0 Defined Posix Asynchronous I/O
423 # cfgmgr -l posix_aio0
424 # lsdev -C -l posix_aio0
425 posix_aio0 Available Posix Asynchronous I/O
426
427POSIX aio should work now. To make the change permanent:
428
429 # chdev -l posix_aio0 -P -a autoconfig='available'
430 posix_aio0 changed
217bc04b
JA
431
432
79809113
JA
433Author
434------
435
aae22ca7 436Fio was written by Jens Axboe <axboe@kernel.dk> to enable flexible testing
79809113
JA
437of the Linux IO subsystem and schedulers. He got tired of writing
438specific test applications to simulate a given workload, and found that
439the existing io benchmark/test tools out there weren't flexible enough
440to do what he wanted.
441
aae22ca7 442Jens Axboe <axboe@kernel.dk> 20060905
79809113 443