Task #10061

Make digitalobject:extract-text task also index transcripts in the ES index

Added by Mike Gale almost 6 years ago.

Status:NewStart date:06/22/2016
Priority:MediumDue date:
Assignee:-% Done:

0%

Category:Digital object
Target version:-
Google Code Legacy ID: Tested version:2.3, 2.4
Sponsored:No Requires documentation:

Description

Currently, the extract-text task only saves PDF transcript text to the database. I think it'd make sense if the task also indexed the transcript it extracts inside ElasticSearch.

Also available in: Atom PDF