Commit Graph

14 Commits

Author SHA1 Message Date
jendib
7f19f8c112 #62: Migration to Lucene 5 (without rebuilding old index) 2016-03-01 01:01:10 +01:00
jendib
a0f309c957 Upgrade libraries 2016-01-29 01:55:59 +01:00
jendib
3172a5f216 Closes #59: Use TwelveMonkeys' ImageIO plugin for JPEG 2016-01-24 15:44:40 +01:00
jendib
1a37d97a61 #53: Handle and extract text content from DOCX and ODT files 2015-12-07 23:53:30 +01:00
jendib
cd32f452e9 Closes #38: Handle JBIG2 images in PDF 2015-11-01 18:10:16 +01:00
jendib
0fe51d355c Closes #29: Upgrade to Jersey 2 2015-09-07 21:51:13 +02:00
jendib
97694d5d59 Closes #26: Cleanup Maven dependencies 2015-09-06 15:21:20 +02:00
jendib
467d14bacb Closes #24: Change to H2 database + indexes tweaks + queries tweaks
Tested up to 100k documents
2015-09-05 12:36:01 +02:00
jendib
22cea20a90 Hard coupling between tess4j and imageIO to avoid service registering 2013-09-05 16:10:26 +02:00
jendib
464d43194b File encryption (in progress) 2013-08-19 23:57:50 +02:00
jendib
d3f59554f8 Index and generate thumbnails from PDF 2013-08-18 13:48:08 +02:00
jendib
1f1f02ed41 Document language (server), OCR files and store result in database 2013-08-16 23:48:35 +02:00
jendib
471933ca8c PDF handling, file upload progression 2013-07-28 18:29:03 +02:00
jendib
9b74bd8194 Initial commit 2013-07-27 18:33:20 +02:00