FS#31808 - [tesseract] missing dependency on tesseract data files

Attached to Project: Community Packages
Opened by Greg (dolby) - Thursday, 04 October 2012, 21:23 GMT
Last edited by Sergej Pupykin (sergej) - Thursday, 15 November 2012, 11:15 GMT
Task Type Bug Report
Category Packages
Status Closed
Assigned To Sergej Pupykin (sergej)
Architecture All
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Tesseract is missing dependency on its split data files.
I dont know the best way to achieve this, IMO this split is uneeded, but maybe it could depend on the data files the same way libreoffice depends on its i10n files or maybe even add tesseract-data-$something as an optdepend so that the user is informed about the existence of the data files without which tesseract is useless.

Additionally it would be nice if the data files got proper descriptions about which languages they add support for.
eg. tesseract-data-fra's description is: Tesseract OCR data (fra)
That should be changed to Tesseract OCR data (French)
This task depends upon

Closed by  Sergej Pupykin (sergej)
Thursday, 15 November 2012, 11:15 GMT
Reason for closing:  Fixed
Comment by Greg (dolby) - Thursday, 04 October 2012, 21:29 GMT
"That should be changed to Tesseract OCR data (French)"

Or maybe something like "Tesseract OCR French data files"
Comment by Sergej Pupykin (sergej) - Friday, 26 October 2012, 13:44 GMT
I added message into post_install and post_upgrade about installing data files.
> "You must install one of tesseract-data-* packages or whole tesseract-data group"

About fixing description: I think I would not implement it, because of technical reason.
Description is generated automatically from data filename:
pkgdesc=\"Tesseract OCR data ($l)\"
Comment by Greg (dolby) - Wednesday, 14 November 2012, 23:54 GMT
  • Field changed: Percent Complete (100% → 0%)
Post install messages arent used for dependency tracking.
Comment by Sergej Pupykin (sergej) - Thursday, 15 November 2012, 04:48 GMT
Ok, I can pack all data file into single package with executable.
Comment by Sergej Pupykin (sergej) - Thursday, 15 November 2012, 04:49 GMT
But I'd rather to leave it as is. data-files are in optdeps and install message say that you must install at least one of them.
Comment by Greg (dolby) - Thursday, 15 November 2012, 05:52 GMT
Hi Sergej, you had already fixed this dependency issue in svn [0] without pushing even at the time i opened this report so feel free to close this.
I didnt realise this myself before requesting the reopen and you didnt write about it here yourself either.

[0]: https://projects.archlinux.org/svntogit/community.git/commit/trunk?h=packages/tesseract&id=f1c9c2129ee9bc4202479a74e74687f3ef7391d7

Comment by Greg (dolby) - Thursday, 15 November 2012, 05:59 GMT
To add to the above since all the data files are now optdepends we could do without the post install message which this bug report mistakenly caused to be added.

https://projects.archlinux.org/svntogit/community.git/commit/trunk?h=packages/tesseract&id=dfba9eb34a88f3a65ad1b7b4d9a95e182af3f4d4

Loading...