Commit Graph

84 Commits

Author SHA1 Message Date
jendib
e72fe3683c #4: Upgrade to unrelease PDFBox 2 2015-09-05 23:12:01 +02:00
jendib
44c10b60cd File update log is useless 2015-09-05 20:06:21 +02:00
jendib
467d14bacb Closes #24: Change to H2 database + indexes tweaks + queries tweaks
Tested up to 100k documents
2015-09-05 12:36:01 +02:00
jendib
6d73554967 #24: High performance is not going to happen on HSQLDB 2015-09-02 01:12:33 +02:00
jendib
9c97ab14f8 Catch all Tesseract related errors 2015-08-29 01:20:06 +02:00
jendib
08e4f6ddae Closes #14: Soft delete on DocumentTag + audit log ordering 2015-08-28 01:02:33 +02:00
jendib
f8d889bb1f Closes #22: incorrect composite ID for DocumentTag 2015-08-24 22:18:47 +02:00
jendib
6add34bb33 #20: Display logs on documents 2015-05-23 19:16:38 +02:00
jendib
ea4e3fd8f2 #20: Audit log displayed on main screen 2015-05-17 22:20:34 +02:00
jendib
b2a38cea62 Closes #21: Save IP and UA on login 2015-05-15 17:30:21 +02:00
jendib
b7f920f864 Native query for GET /document/id 2015-05-10 13:45:39 +02:00
jendib
52387d93ac Closes #13: Don't show tags from other users 2015-05-09 21:52:01 +02:00
jendib
42320dc9b9 #13: Fix performance issue 2015-05-09 18:00:03 +02:00
jendib
ff994ce63b #13: Disable shared status in GET /document/list (too slow) 2015-05-09 16:48:01 +02:00
jendib
fc1bb22d8d #13: ACL system 2015-05-09 14:44:19 +02:00
jendib
c283607063 Don't crash if a file is deleted before OCR is completed 2015-04-29 01:28:42 +02:00
Walter
192c2030d3 Dockerization + Fix for Tesseract 3.03 2015-03-11 00:35:42 +01:00
jendib
18cedaef2c Orphan files are linked to a specific user 2015-03-06 22:40:33 +01:00
jendib
d0c259ead2 List orphan files 2015-03-06 21:23:50 +01:00
jendib
2347483676 Order of files attached to document 2015-03-06 21:13:09 +01:00
jendib
6c976087de Missing file + TODO 2015-03-03 00:26:40 +01:00
jendib
c36014b46f Ability to upload files without document (no OCR, no Lucene)
+ New resource to attach a document to a file and OCR/Lucene it
2015-03-03 00:23:30 +01:00
jendib
34e3ac5478 Download all files from a document as ZIP 2014-02-23 14:09:41 +01:00
jendib
77f0368ba5 Hardwire TIFFImageReaderSpi to avoid registering bug (again) 2013-11-07 01:12:11 +01:00
jendib
d2ba291287 Change package for tess4j 2013-10-01 22:24:12 +02:00
jendib
726121d8c8 Fix Hibernate entity 2013-09-05 21:43:45 +02:00
jendib
22cea20a90 Hard coupling between tess4j and imageIO to avoid service registering 2013-09-05 16:10:26 +02:00
jendib
fc53758eb7 Typo, TODO 2013-09-03 18:03:44 +02:00
jendib
f5079e83cb Typo 2013-09-03 09:19:47 +02:00
jendib
b399e4081f Typo 2013-09-02 23:28:38 +02:00
jendib
ac3580fb4a Fix leak 2013-08-26 22:03:14 +02:00
jendib
870a44da0d Return file count on GET /document/list 2013-08-22 17:59:24 +02:00
jendib
db7a9f0e4a Encrypt stored files in SHA 256 2013-08-20 21:51:07 +02:00
jendib
906de329ae DB update script 6 2013-08-20 18:55:49 +02:00
jendib
00b00f0d0c File encryption utilities 2013-08-20 18:06:08 +02:00
jendib
0bc658a396 More loading feedback (client) 2013-08-20 00:57:22 +02:00
jendib
464d43194b File encryption (in progress) 2013-08-19 23:57:50 +02:00
jendib
2289f523c4 Close all PDDocument 2013-08-18 14:25:53 +02:00
jendib
2c7083aa43 Force loading of corrupted PDF 2013-08-18 14:11:08 +02:00
jendib
d3f59554f8 Index and generate thumbnails from PDF 2013-08-18 13:48:08 +02:00
jendib
4f8076427b Search criterias on full content and specific date 2013-08-18 13:02:50 +02:00
jendib
fd95ecc4cb Batch to regenerate all file variations 2013-08-18 02:12:48 +02:00
jendib
dab6f4b9d1 Intermediate thumbnail size more web-friendly 2013-08-18 01:26:34 +02:00
jendib
6b5c1b2b51 Delete files from storage when necessary,
batch to cleanup storage for orphan files,
better Lucene directory reader management
2013-08-18 00:53:01 +02:00
jendib
00ed2e3c25 Close IndexReader after search 2013-08-17 23:16:19 +02:00
jendib
5451c79674 Fix OCR all files batch 2013-08-17 16:59:16 +02:00
jendib
e2bd85da5f OCR all files in database asynchronously 2013-08-17 16:54:53 +02:00
jendib
234eaf047e Add logs before OCR 2013-08-17 15:47:07 +02:00
jendib
8378b346e9 Null check on Lucene documents, fix documents search query 2013-08-17 15:18:42 +02:00
jendib
7ed976b27a Index files OCR-ized content and documents, search on index fields 2013-08-17 14:16:55 +02:00