Commit Graph

21 Commits

Author SHA1 Message Date
jendib
1a37d97a61 #53: Handle and extract text content from DOCX and ODT files 2015-12-07 23:53:30 +01:00
jendib
0fab0e4fc0 RAM Lucene storage for Junit + Surefire 2.18.1 forking mode 2015-11-29 20:22:24 +01:00
jendib
7e5aa9aecf Closes #44: Comments visible from share app
+ metadata-complete="true" in web.xml to skip annotations scanning
(second try with Jetty 9)
2015-11-21 20:31:21 +01:00
jendib
97252bb5da #32: Comments system (server side) 2015-11-16 02:22:51 +01:00
jendib
cd32f452e9 Closes #38: Handle JBIG2 images in PDF 2015-11-01 18:10:16 +01:00
jendib
0fe51d355c Closes #29: Upgrade to Jersey 2 2015-09-07 21:51:13 +02:00
jendib
97694d5d59 Closes #26: Cleanup Maven dependencies 2015-09-06 15:21:20 +02:00
jendib
e72fe3683c #4: Upgrade to unrelease PDFBox 2 2015-09-05 23:12:01 +02:00
jendib
467d14bacb Closes #24: Change to H2 database + indexes tweaks + queries tweaks
Tested up to 100k documents
2015-09-05 12:36:01 +02:00
jendib
6d73554967 #24: High performance is not going to happen on HSQLDB 2015-09-02 01:12:33 +02:00
jendib
6558ff7e05 tabs -> spaces 2015-08-29 00:14:47 +02:00
jendib
374310d13c Init stress app 2015-08-29 00:12:15 +02:00
jendib
07247854ac #4 : Upgrade PDFBox 2015-03-24 22:20:54 +01:00
jendib
22cea20a90 Hard coupling between tess4j and imageIO to avoid service registering 2013-09-05 16:10:26 +02:00
jendib
464d43194b File encryption (in progress) 2013-08-19 23:57:50 +02:00
jendib
d3f59554f8 Index and generate thumbnails from PDF 2013-08-18 13:48:08 +02:00
jendib
1f1f02ed41 Document language (server), OCR files and store result in database 2013-08-16 23:48:35 +02:00
jendib
779c45a1e2 pom.xml to Java 7 2013-07-30 10:08:30 +02:00
jendib
cd97382f60 Sequential files upload, basic search system 2013-07-29 00:04:34 +02:00
jendib
471933ca8c PDF handling, file upload progression 2013-07-28 18:29:03 +02:00
jendib
9b74bd8194 Initial commit 2013-07-27 18:33:20 +02:00