Community Packages

Please read this before reporting a bug:
https://wiki.archlinux.org/title/Bug_reporting_guidelines

Do NOT report bugs when a package is just outdated, or it is in the AUR. Use the 'flag out of date' link on the package page, or the Mailing List.

REPEAT: Do NOT report bugs for outdated packages!
Tasklist

FS#22309 - [tracker] cannot extract text from OpenDocument files

Attached to Project: Community Packages
Opened by Matthias Dienstbier (fs4000) - Tuesday, 04 January 2011, 15:38 GMT
Last edited by Andrea Scarpino (BaSh) - Tuesday, 08 March 2011, 18:26 GMT
Task Type Bug Report
Category Packages
Status Closed
Assigned To Sergej Pupykin (sergej)
Architecture All
Severity Medium
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description:
Meta Tracker cannot index OASIS OpenDocument files because "odt2txt" is missing.

This can be verified by creating an *.odt file and looking into ~/.local/share/tracker/tracker-extract.log:

04 Jan 2011, 16:22:11: Tracker-Warning **: Spawning failed, could not extract text from '/home/matthias/test.odt': Kindprozess »odt2txt« konnte nicht ausgeführt werden (Datei oder Verzeichnis nicht gefunden)

We have odt2txt in AUR: http://aur.archlinux.org/packages.php?ID=9856

But I don't know if it's worth taking it into [community] since the development version dropped the dependency in version 0.9.17: http://git.gnome.org/browse/tracker/commit/?id=9a3c8cfc2cd0b6b4bb8420c244551dc139bf80f6
https://bugzilla.gnome.org/show_bug.cgi?id=623005

Perhaps we simply can use the new plugin with a patch.
This task depends upon

Closed by  Andrea Scarpino (BaSh)
Tuesday, 08 March 2011, 18:26 GMT
Reason for closing:  Fixed
Additional comments about closing:  tracker 0.10-1
Comment by Matthias Dienstbier (fs4000) - Tuesday, 04 January 2011, 22:04 GMT
I created a patch to include the new parser for OpenDocument from upstream, but it seems to do not parse very well. For me simple text still doesn't get indexed, just headings, tables, etc. I think it doesn't conform to the ODF standard.

EDIT: simplified patch
Comment by Matthias Dienstbier (fs4000) - Wednesday, 05 January 2011, 18:26 GMT
Another fix: I think the evolution plugin can be enabled again with that line:
sed -i '/#include <camel\/camel-db.h>/d' src/plugins/evolution/tracker-evolution-plugin.c

I wasn't able to find any mail via Tracker, but the plugin compiles and Tracker seems to be indexing the mails.
Comment by Jelle van der Waa (jelly) - Monday, 10 January 2011, 13:48 GMT
"Tracker plugin for odt, odp, ods files. "odt2txt" functionality is replaced
with the xml file extraction functionality."

Since upstream moved to xml, we will too. I will look into adding xml support to tracker and evolution support
Comment by Jelle van der Waa (jelly) - Monday, 10 January 2011, 15:37 GMT
i have build tracker with evolution mining support, I will first test this

Loading...