X-Git-Url: https://git.kernel.dk/?a=blobdiff_plain;f=fio.1;h=d64fbb7ab5e0832f61f167d7fe2b20872f9ec2c1;hb=0bc27b0b7019e4c386f83258430fb6b3ac34cc06;hp=c58e81769d3585c09a808c057ed6a361ab034f3f;hpb=b26317c97b8bcca7cc9144ea6e1d4ed753f2311a;p=fio.git diff --git a/fio.1 b/fio.1 index c58e8176..d64fbb7a 100644 --- a/fio.1 +++ b/fio.1 @@ -478,7 +478,15 @@ size, fio can alternate random and zeroed data throughout the IO buffer. If set, fio will fill the IO buffers with this pattern. If not set, the contents of IO buffers is defined by the other options related to buffer contents. The setting can be any pattern of bytes, and can be prefixed with 0x for hex -values. +values. It may also be a string, where the string must then be wrapped with +"". +.TP +.BI dedupe_percentage \fR=\fPint +If set, fio will generate this percentage of identical buffers when writing. +These buffers will be naturally dedupable. The contents of the buffers depend +on what other buffer compression settings have been set. It's possible to have +the individual buffers either fully compressible, or not at all. This option +only controls the distribution of unique buffers. .TP .BI nrfiles \fR=\fPint Number of files to use for this job. Default: 1. @@ -612,6 +620,17 @@ options. Using Glusterfs libgfapi async interface to direct access to Glusterfs volumes without having to go through FUSE. This ioengine defines engine specific options. +.TP +.B libhdfs +Read and write through Hadoop (HDFS). The \fBfilename\fR option is used to +specify host,port of the hdfs name-node to connect. This engine interprets +offsets a little differently. In HDFS, files once created cannot be modified. +So random writes are not possible. To imitate this, libhdfs engine expects +bunch of small files to be created over HDFS, and engine will randomly pick a +file out of those files based on the offset generated by fio backend. (see the +example job file to create such files, use rw=write option). Please note, you +might want to set necessary environment variables to work with hdfs/libhdfs +properly. .RE .P .RE @@ -657,17 +676,20 @@ Offset in the file to start I/O. Data before the offset will not be touched. .TP .BI offset_increment \fR=\fPint If this is provided, then the real offset becomes the -offset + offset_increment * thread_number, where the thread number is a counter -that starts at 0 and is incremented for each job. This option is useful if -there are several jobs which are intended to operate on a file in parallel in -disjoint segments, with even spacing between the starting points. +offset + offset_increment * thread_number, where the thread number is a +counter that starts at 0 and is incremented for each sub-job (i.e. when +numjobs option is specified). This option is useful if there are several jobs +which are intended to operate on a file in parallel disjoint segments, with +even spacing between the starting points. .TP .BI number_ios \fR=\fPint Fio will normally perform IOs until it has exhausted the size of the region set by \fBsize\fR, or if it exhaust the allocated time (or hits an error condition). With this setting, the range/size can be set independently of the number of IOs to perform. When fio reaches this number, it will exit -normally and report status. +normally and report status. Note that this does not extend the amount +of IO that will be done, it will only stop fio if this condition is met +before other end-of-job criteria. .TP .BI fsync \fR=\fPint How many I/Os to perform before issuing an \fBfsync\fR\|(2) of dirty data. If @@ -1191,17 +1213,21 @@ If given, write a bandwidth log of the jobs in this job file. Can be used to store data of the bandwidth of the jobs in their lifetime. The included fio_generate_plots script uses gnuplot to turn these text files into nice graphs. See \fBwrite_lat_log\fR for behaviour of given filename. For this -option, the postfix is _bw.log. +option, the postfix is _bw.x.log, where x is the index of the job (1..N, +where N is the number of jobs) .TP .BI write_lat_log \fR=\fPstr Same as \fBwrite_bw_log\fR, but writes I/O completion latencies. If no -filename is given with this option, the default filename of "jobname_type.log" -is used. Even if the filename is given, fio will still append the type of log. +filename is given with this option, the default filename of +"jobname_type.x.log" is used, where x is the index of the job (1..N, where +N is the number of jobs). Even if the filename is given, fio will still +append the type of log. .TP .BI write_iops_log \fR=\fPstr Same as \fBwrite_bw_log\fR, but writes IOPS. If no filename is given with this -option, the default filename of "jobname_type.log" is used. Even if the -filename is given, fio will still append the type of log. +option, the default filename of "jobname_type.x.log" is used, where x is the +index of the job (1..N, where N is the number of jobs). Even if the filename +is given, fio will still append the type of log. .TP .BI log_avg_msec \fR=\fPint By default, fio will log an entry in the iops, latency, or bw log for every