Merge tag 'ubifs-for-linus-6.4-rc1' of git://git.kernel.org/pub/scm/linux/kernel...
[linux-block.git] / Documentation / filesystems / tmpfs.rst
CommitLineData
7e7cd458
MCC
1.. SPDX-License-Identifier: GPL-2.0
2
3=====
4Tmpfs
5=====
6
f38d58b7 7Tmpfs is a file system which keeps all of its files in virtual memory.
1da177e4
LT
8
9
10Everything in tmpfs is temporary in the sense that no files will be
11created on your hard drive. If you unmount a tmpfs instance,
12everything stored therein is lost.
13
14tmpfs puts everything into the kernel internal caches and grows and
15shrinks to accommodate the files it contains and is able to swap
2c6efe9c
LC
16unneeded pages out to swap space, if swap was enabled for the tmpfs
17mount. tmpfs also supports THP.
d0f5a854
LC
18
19tmpfs extends ramfs with a few userspace configurable options listed and
20explained further below, some of which can be reconfigured dynamically on the
21fly using a remount ('mount -o remount ...') of the filesystem. A tmpfs
22filesystem can be resized but it cannot be resized to a size below its current
23usage. tmpfs also supports POSIX ACLs, and extended attributes for the
24trusted.* and security.* namespaces. ramfs does not use swap and you cannot
25modify any parameter for a ramfs filesystem. The size limit of a ramfs
26filesystem is how much memory you have available, and so care must be taken if
27used so to not run out of memory.
28
29An alternative to tmpfs and ramfs is to use brd to create RAM disks
30(/dev/ram*), which allows you to simulate a block device disk in physical RAM.
31To write data you would just then need to create an regular filesystem on top
32this ramdisk. As with ramfs, brd ramdisks cannot swap. brd ramdisks are also
33configured in size at initialization and you cannot dynamically resize them.
34Contrary to brd ramdisks, tmpfs has its own filesystem, it does not rely on the
35block layer at all.
1da177e4 36
2c6efe9c
LC
37Since tmpfs lives completely in the page cache and optionally on swap,
38all tmpfs pages will be shown as "Shmem" in /proc/meminfo and "Shared" in
0bc126d4
RF
39free(1). Notice that these counters also include shared memory
40(shmem, see ipcs(1)). The most reliable way to get the count is
41using df(1) and du(1).
1da177e4
LT
42
43tmpfs has the following uses:
44
451) There is always a kernel internal mount which you will not see at
46 all. This is used for shared anonymous mappings and SYSV shared
7e7cd458 47 memory.
1da177e4
LT
48
49 This mount does not depend on CONFIG_TMPFS. If CONFIG_TMPFS is not
f38d58b7 50 set, the user visible part of tmpfs is not built. But the internal
1da177e4
LT
51 mechanisms are always present.
52
532) glibc 2.2 and above expects tmpfs to be mounted at /dev/shm for
54 POSIX shared memory (shm_open, shm_unlink). Adding the following
7e7cd458 55 line to /etc/fstab should take care of this::
1da177e4
LT
56
57 tmpfs /dev/shm tmpfs defaults 0 0
58
59 Remember to create the directory that you intend to mount tmpfs on
bf6ee0ae 60 if necessary.
1da177e4
LT
61
62 This mount is _not_ needed for SYSV shared memory. The internal
63 mount is used for that. (In the 2.3 kernel versions it was
64 necessary to mount the predecessor of tmpfs (shm fs) to use SYSV
f38d58b7 65 shared memory.)
1da177e4
LT
66
673) Some people (including me) find it very convenient to mount it
68 e.g. on /tmp and /var/tmp and have a big swap partition. And now
69 loop mounts of tmpfs files do work, so mkinitrd shipped by most
70 distributions should succeed with a tmpfs /tmp.
71
724) And probably a lot more I do not know about :-)
73
74
75tmpfs has three mount options for sizing:
76
7e7cd458
MCC
77========= ============================================================
78size The limit of allocated bytes for this tmpfs instance. The
1da177e4
LT
79 default is half of your physical RAM without swap. If you
80 oversize your tmpfs instances the machine will deadlock
81 since the OOM handler will not be able to free that memory.
7e7cd458
MCC
82nr_blocks The same as size, but in blocks of PAGE_SIZE.
83nr_inodes The maximum number of inodes for this instance. The default
1da177e4 84 is half of the number of your physical RAM pages, or (on a
670e9f34 85 machine with highmem) the number of lowmem RAM pages,
1da177e4 86 whichever is the lower.
2c6efe9c
LC
87noswap Disables swap. Remounts must respect the original settings.
88 By default swap is enabled.
7e7cd458 89========= ============================================================
1da177e4
LT
90
91These parameters accept a suffix k, m or g for kilo, mega and giga and
92can be changed on remount. The size parameter also accepts a suffix %
93to limit this tmpfs instance to that percentage of your physical RAM:
94the default, when neither size nor nr_blocks is specified, is size=50%
95
0edd73b3
HD
96If nr_blocks=0 (or size=0), blocks will not be limited in that instance;
97if nr_inodes=0, inodes will not be limited. It is generally unwise to
1da177e4
LT
98mount with such options, since it allows any user with write access to
99use up all the memory on the machine; but enhances the scalability of
f38d58b7 100that instance in a system with many CPUs making intensive use of it.
1da177e4 101
d0f5a854
LC
102tmpfs also supports Transparent Huge Pages which requires a kernel
103configured with CONFIG_TRANSPARENT_HUGEPAGE and with huge supported for
104your system (has_transparent_hugepage(), which is architecture specific).
105The mount options for this are:
106
107====== ============================================================
108huge=0 never: disables huge pages for the mount
109huge=1 always: enables huge pages for the mount
110huge=2 within_size: only allocate huge pages if the page will be
111 fully within i_size, also respect fadvise()/madvise() hints.
112huge=3 advise: only allocate huge pages if requested with
113 fadvise()/madvise()
114====== ============================================================
115
116There is a sysfs file which you can also use to control system wide THP
117configuration for all tmpfs mounts, the file is:
118
119/sys/kernel/mm/transparent_hugepage/shmem_enabled
120
121This sysfs file is placed on top of THP sysfs directory and so is registered
122by THP code. It is however only used to control all tmpfs mounts with one
123single knob. Since it controls all tmpfs mounts it should only be used either
124for emergency or testing purposes. The values you can set for shmem_enabled are:
125
126== ============================================================
127-1 deny: disables huge on shm_mnt and all mounts, for
128 emergency use
129-2 force: enables huge on shm_mnt and all mounts, w/o needing
130 option, for testing
131== ============================================================
1da177e4 132
7339ff83 133tmpfs has a mount option to set the NUMA memory allocation policy for
b00dc3ad
HD
134all files in that instance (if CONFIG_NUMA is enabled) - which can be
135adjusted on the fly via 'mount -o remount ...'
7339ff83 136
7e7cd458 137======================== ==============================================
55741696
KM
138mpol=default use the process allocation policy
139 (see set_mempolicy(2))
b00dc3ad
HD
140mpol=prefer:Node prefers to allocate memory from the given Node
141mpol=bind:NodeList allocates memory only from nodes in NodeList
142mpol=interleave prefers to allocate from each node in turn
143mpol=interleave:NodeList allocates from each node of NodeList in turn
55741696 144mpol=local prefers to allocate memory from the local node
7e7cd458 145======================== ==============================================
b00dc3ad
HD
146
147NodeList format is a comma-separated list of decimal numbers and ranges,
148a range being two hyphen-separated decimal numbers, the smallest and
149largest node numbers in the range. For example, mpol=bind:0-3,5,7,9-15
7339ff83 150
971ada0f
LS
151A memory policy with a valid NodeList will be saved, as specified, for
152use at file creation time. When a task allocates a file in the file
153system, the mount option memory policy will be applied with a NodeList,
154if any, modified by the calling task's cpuset constraints
7e7cd458
MCC
155[See Documentation/admin-guide/cgroup-v1/cpusets.rst] and any optional flags,
156listed below. If the resulting NodeLists is the empty set, the effective
157memory policy for the file will revert to "default" policy.
971ada0f 158
65d66fc0
DR
159NUMA memory allocation policies have optional flags that can be used in
160conjunction with their modes. These optional flags can be specified
161when tmpfs is mounted by appending them to the mode before the NodeList.
3ecf53e4
MR
162See Documentation/admin-guide/mm/numa_memory_policy.rst for a list of
163all available memory allocation policy mode flags and their effect on
164memory policy.
65d66fc0 165
7e7cd458
MCC
166::
167
65d66fc0
DR
168 =static is equivalent to MPOL_F_STATIC_NODES
169 =relative is equivalent to MPOL_F_RELATIVE_NODES
170
171For example, mpol=bind=static:NodeList, is the equivalent of an
172allocation policy of MPOL_BIND | MPOL_F_STATIC_NODES.
173
ad329b15
HD
174Note that trying to mount a tmpfs with an mpol option will fail if the
175running kernel does not support NUMA; and will fail if its nodelist
a210906c
HD
176specifies a node which is not online. If your system relies on that
177tmpfs being mounted, but from time to time runs a kernel built without
178NUMA capability (perhaps a safe recovery kernel), or with fewer nodes
179online, then it is advisable to omit the mpol option from automatic
ad329b15
HD
180mount options. It can be added later, when the tmpfs is already mounted
181on MountPoint, by 'mount -o remount,mpol=Policy:NodeList MountPoint'.
182
7339ff83 183
1da177e4
LT
184To specify the initial root directory you can use the following mount
185options:
186
7e7cd458
MCC
187==== ==================================
188mode The permissions as an octal number
189uid The user id
190gid The group id
191==== ==================================
1da177e4
LT
192
193These options do not have any effect on remount. You can change these
194parameters with chmod(1), chown(1) and chgrp(1) on a mounted filesystem.
195
196
ea3271f7
CD
197tmpfs has a mount option to select whether it will wrap at 32- or 64-bit inode
198numbers:
199
200======= ========================
201inode64 Use 64-bit inode numbers
202inode32 Use 32-bit inode numbers
203======= ========================
204
205On a 32-bit kernel, inode32 is implicit, and inode64 is refused at mount time.
206On a 64-bit kernel, CONFIG_TMPFS_INODE64 sets the default. inode64 avoids the
207possibility of multiple files with the same inode number on a single device;
208but risks glibc failing with EOVERFLOW once 33-bit inode numbers are reached -
209if a long-lived tmpfs is accessed by 32-bit applications so ancient that
210opening a file larger than 2GiB fails with EINVAL.
211
212
1da177e4
LT
213So 'mount -t tmpfs -o size=10G,nr_inodes=10k,mode=700 tmpfs /mytmpfs'
214will give you tmpfs instance on /mytmpfs which can allocate 10GB
215RAM/SWAP in 10240 inodes and it is only accessible by root.
216
217
7e7cd458 218:Author:
1da177e4 219 Christoph Rohland <cr@sap.com>, 1.12.01
7e7cd458 220:Updated:
98f32602 221 Hugh Dickins, 4 June 2007
7e7cd458 222:Updated:
55741696 223 KOSAKI Motohiro, 16 Mar 2010
ea3271f7
CD
224:Updated:
225 Chris Down, 13 July 2020