FS#8383 - Unzip and non-latin encodings

Attached to Project: Arch Linux
Opened by Artyom (urandom) - Tuesday, 23 October 2007, 13:50 GMT
Last edited by Roman Kyrylych (Romashka) - Sunday, 18 November 2007, 13:58 GMT
Task Type Bug Report
Category Packages: Extra
Status Closed
Assigned To Roman Kyrylych (Romashka)
Dan McGee (toofishes)
Architecture All
Severity Low
Priority Normal
Reported Version 2007.08-2
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 2
Private No

Details

Unzipping archives created with different locale than system, provide strange encoding. Alt Linux team for example use unzip with iconv patch.
This task depends upon

Closed by  Roman Kyrylych (Romashka)
Sunday, 18 November 2007, 13:58 GMT
Reason for closing:  Fixed
Comment by Dan McGee (toofishes) - Tuesday, 23 October 2007, 17:50 GMT
If you would post a link to this patch and any discussion about it, it would be appreciated.
Comment by Artyom (urandom) - Wednesday, 24 October 2007, 19:53 GMT
Item in Alt Linux bugtracker about unzip: https://bugzilla.altlinux.org/show_bug.cgi?id=4871
Iconv patch with utf8 support: https://bugzilla.altlinux.org/attachment.cgi?id=1402
All talks about this bug in Russian only (Alt is Russian distributive)
Comment by Roman Kyrylych (Romashka) - Thursday, 25 October 2007, 10:17 GMT
Ok, I've read comments on that report.
The patch seems ok to me.
It adds two options for charset conversion + it autoguesses input & output charsets (Russian and Ukrainian only) which is useful for software where you cannot specify conversion options manually.
Artyom, can you confirm that it does autoguessing for UTF-8 locale too?
Comment by Artyom (urandom) - Thursday, 25 October 2007, 10:28 GMT
Yes, unzipping archive created on windows provide correct conversion in ru_RU.UTF-8 with this patch
Comment by Roman Kyrylych (Romashka) - Friday, 16 November 2007, 14:40 GMT
I've uploaded new unzip-5.52-3 to Testing. Please test.
Comment by Artyom (urandom) - Friday, 16 November 2007, 17:15 GMT
Good, work rightly for ru_RU.UTF-8.
Comment by Dan McGee (toofishes) - Friday, 16 November 2007, 17:39 GMT
Thanks Roman, I feel better with you doing this patch since I can't read Russian.

Just make sure it works in some non-Russian locales fine too (C, POSIX, en_US, etc).

Loading...