Arch Linux

Please read this before reporting a bug:
https://wiki.archlinux.org/title/Bug_reporting_guidelines

Do NOT report bugs when a package is just outdated, or it is in the AUR. Use the 'flag out of date' link on the package page, or the Mailing List.

REPEAT: Do NOT report bugs for outdated packages!
Tasklist

FS#25576 - [glibc] fi_FI has wrong collation for v, w, å, ä, ö.

Attached to Project: Arch Linux
Opened by Lauri Kenttä (LK) - Sunday, 14 August 2011, 17:02 GMT
Last edited by Allan McRae (Allan) - Monday, 15 August 2011, 00:19 GMT
Task Type Bug Report
Category Packages: Core
Status Closed
Assigned To Allan McRae (Allan)
Architecture All
Severity Low
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

LC_COLLATION=fi_FI treats some lower-case letters (v, w, å, ä, ö) as upper-case. This causes problems with grepping, for example.

Ubuntu has it right, maybe we should use their version? Patch attached.

Steps to reproduce:
1) Get yourself fi_FI.UTF-8 locale.
2) Test:
export LC_COLLATION=fi_FI.UTF-8
echo v | grep -E '[u-x]' || echo - # expected: v, actual: -
echo v | grep -E '[U-X]' || echo - # expected: -, actual: v
echo w | grep -E '[u-x]' || echo - # expected: w, actual: -
echo w | grep -E '[U-X]' || echo - # expected: -, actual: w
echo ä | grep -E '[Å-Ö]' || echo - # expected: -, actual: ä
echo Ä | grep -E '[å-ö]' || echo - # expected: -, actual: Ä
echo abvcd | sed -r 's/[a-z]+/-/g' # expected: -, actual: -v-

This task depends upon

Closed by  Allan McRae (Allan)
Monday, 15 August 2011, 00:19 GMT
Reason for closing:  Not a bug
Comment by Allan McRae (Allan) - Monday, 15 August 2011, 00:17 GMT
LC_COLLATION=fi_FI.UTF-8 echo w | grep -E '[[:lower:]]'
LC_COLLATION=fi_FI.UTF-8 echo w | grep -E '[[:upper:]]'

You need to learn regex...

Edit: well not regex, but that [u-x] does not guarantee "u,v,w,x" in every locale in grep.

Loading...