wiki:mcgill_server_setup

Setup of McGill servers

images.rasi.mcgill.ca

images.rasi is now an alias for db.rasi.mcgill.ca. It uses a Letsencrypt SSL certificate installed using Certbot (https://certbot.eff.org).

IIPImage image server

The image server at https://images.rasi.mcgill.ca/fcgi-bin/iipsrv.fcgi now runs IIPImage using the FastCGI module of Apache 2.4.

The current version was taken from https://github.com/ruven/iipsrv (2017-10-16) and compiled with OpenJPEG 2.3.0 from https://github.com/uclouvain/openjpeg

Renamer

(Andrews email)

The transfers scripts for moving files from data3 to data1 are all online at https://github.com/ahankinson/renamer. It's basically just a set of steps that the images go through to get transformed into a standardized directory structure and format. It's written as a Django web application.

The heart of the project is here: https://github.com/ahankinson/renamer/tree/master/renamer/helpers

'to_archive' is what moves the files from data3 to data1. It does all sorts of data transformations, since the data is delivered to you in lots of different formats:

  • If it's a zip file it will uncompress it
  • If it's a PDF file it will extract the page images to lossless PNG

It will also perform a basic data integrity check by checksumming the files, and will store the originals so that they can be recovered if the conversion process goes wrong. It follows the MPIWG directory structure having pageimg and info directories, but also adds 'backup' for the originals.

'to_diva' is what moves the files from data1 to data7 for serving with the IIP Image Server (JPEG 2000). It uses the kakadu 'kdu_compress' script, and also generates a Diva-specific JSON file for doing the layout and formatting of the images in the viewer.

The rest of the web application is really just built around these scripts. It uses the 'celery' process manager to queue up the processing jobs, but other than that it's pretty straightforward.

Last modified 4 years ago Last modified on Oct 18, 2017, 1:20:07 PM