unicode: Recreate utf8_parse_version()
authorAndré Almeida <andrealmeid@igalia.com>
Mon, 21 Oct 2024 16:37:20 +0000 (13:37 -0300)
committerChristian Brauner <brauner@kernel.org>
Mon, 28 Oct 2024 12:36:54 +0000 (13:36 +0100)
commit142fa60f61f93805471012f24e029af6d113c5cc
tree594b205b6cc47af0c7545f3c320a7b6216928e46
parent04dad6c6d37d741bad9946a92171bfa637e989f0
unicode: Recreate utf8_parse_version()

All filesystems that currently support UTF-8 casefold can fetch the
UTF-8 version from the filesystem metadata stored on disk. They can get
the data stored and directly match it to a integer, so they can skip the
string parsing step, which motivated the removal of this function in the
first place.

However, for tmpfs, the only way to tell the kernel which UTF-8 version
we are about to use is via mount options, using a string. Re-introduce
utf8_parse_version() to be used by tmpfs.

This version differs from the original by skipping the intermediate step
of copying the version string to an auxiliary string before calling
match_token(). This versions calls match_token() in the argument string.
The paramenters are simpler now as well.

utf8_parse_version() was created by 9d53690f0d4 ("unicode: implement
higher level API for string handling") and later removed by 49bd03cc7e9
("unicode: pass a UNICODE_AGE() tripple to utf8_load").

Signed-off-by: André Almeida <andrealmeid@igalia.com>
Link: https://lore.kernel.org/r/20241021-tonyk-tmpfs-v8-4-f443d5814194@igalia.com
Reviewed-by: Theodore Ts'o <tytso@mit.edu>
Reviewed-by: Gabriel Krisman Bertazi <krisman@suse.de>
Signed-off-by: Christian Brauner <brauner@kernel.org>
fs/unicode/utf8-core.c
include/linux/unicode.h