Document Viewer

View, access and work on full documents.

  • The module is an extension of the IIIF Image Viewer;
  • No needs for any external plugins, content stay locally;
  • Hight quality visualization with minimal bandwidth use, mobile friendly;
  • "Search inside" feature with highlighting functionality;
  • and much more!

Screenshots

Description of the available code

The Document Viewer is an extension of the IIIF Image Viewer module (that is a prerequisite software). The Document Viewer allows the online visualization of PDF files in the browser, using only standard web features without the need for any external plugins, such as Flash Player, etc, and with minimal bandwidth use. The viewer is extendible to other format as detailed in the future functionalities section. No third party services are involved, both the original files and the web optimized files used by the viewer reside in the DSpace instance. The original file (PDF, etc.) is not downloaded by the browser, instead partial and resized image files are downloaded at the optimal resolution for the current device and zoom level.

A suite of curation tasks run to process each PDF page:

  • an image with configurable resolution is extracted, to balance quality and disk usage;
  • a text representation of the image is extracted, while preserving the positioning of data;
  • textual information is indexed with positions in the IIIF Search API.

The viewer prevents end users to copy and paste the content of the file, and downloading of the original PDF file can be avoided. The viewer provides a “search inside” feature with highlighting functionality for PDFs where text extraction is possible. Combining the Document Viewer module with the OCR module allows to exploit the “search inside” and the highlighting also for scanned (image) PDFs.

Take a quick look at it

Live demo

Check our services

Services

The new features we could develop with your support

  • EPUB format support;

  • Microsoft Word formats support; RTF format automatic conversion to PDF file;

  • TEI format support;

  • PostScript (PS) format support.

To access the code and start using the module: €3,000 (+ € 5,000 for the IIIF Image Module).
You can express your preference on the functionality you would like us to develop first.

Make IT open!

Target budget: €75,000
4%

We thank the following institution for their contribution:

Access & use: €3,000

Other modules:

CKAN Integration

Add Research Data Management features to your DSpace.

IIIF Image Viewer

Use an international standard to work with image collections.

OCR & Transcription

Search anything you need in your digitalized documents.

Video/Audio Streaming

Simplify access and reuse of audio/video content.

Other solutions