diff options
authorNishanth Menon <>2021-07-07 15:48:40 -0500
committerJonathan Corbet <>2021-07-12 09:56:50 -0600
commit40751c6c9bea6a5cfede7c61ee5f3cb1ab857029 (patch)
parente73f0f0ee7541171d89f2e2491130c7771ba58d3 (diff)
scripts/ Strictly read license files in utf-8
Commit bc41a7f36469 ("LICENSES: Add the CC-BY-4.0 license") unfortunately introduced LICENSES/dual/CC-BY-4.0 in UTF-8 Unicode text While python will barf at it with: FAIL: 'ascii' codec can't decode byte 0xe2 in position 2109: ordinal not in range(128) Traceback (most recent call last): File "scripts/", line 244, in <module> spdx = read_spdxdata(repo) File "scripts/", line 47, in read_spdxdata for l in open(el.path).readlines(): File "/usr/lib/python3.6/encodings/", line 26, in decode return codecs.ascii_decode(input, self.errors)[0] UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 2109: ordinal not in range(128) While it is indeed debatable if 'Licensor.' used in the license file needs unicode quotes, instead, force spdxcheck to read utf-8. Reported-by: Rahul T R <> Signed-off-by: Nishanth Menon <> Reviewed-by: Thomas Gleixner <> Link: Signed-off-by: Jonathan Corbet <>
1 files changed, 1 insertions, 1 deletions
diff --git a/scripts/ b/scripts/
index 3e784cf9f401..ebd06ae642c9 100755
--- a/scripts/
+++ b/scripts/
@@ -44,7 +44,7 @@ def read_spdxdata(repo):
exception = None
- for l in open(el.path).readlines():
+ for l in open(el.path, encoding="utf-8").readlines():
if l.startswith('Valid-License-Identifier:'):
lid = l.split(':')[1].strip().upper()
if lid in spdx.licenses: