Project

General

Profile

Statistics
| Revision:

# Date Author Comment
53691 09/11/2018 03:50 PM Claudio Atzori

Kaggle/Reactome: added configurable params

53688 09/11/2018 02:20 PM Giorgos Papanikos

added configurable producer timeout

53685 09/11/2018 01:50 PM Giorgos Papanikos

Added safeguard parsing of produced xml to catch badly escaped illegal characters (eg etf: \001 escaped as ). Changed harvesting thread execution from ThreadExecutor to Thread.Start

53683 09/11/2018 11:45 AM Claudio Atzori

Kaggle/Reactome: factored out file write procedure

53677 08/11/2018 05:29 PM Giorgos Papanikos

added more logging and removed fair option from blocking queue

53664 08/11/2018 11:29 AM Giorgos Papanikos

Updatged string identifier type reference to use enum

53663 08/11/2018 11:21 AM Claudio Atzori

added main classes to verify the content collected from Kaggle and Reactome

53660 08/11/2018 10:41 AM Giorgos Papanikos
53641 06/11/2018 06:49 PM Giorgos Papanikos

Added default empty dataset document serialization for endpoints where no dataset can be retrieved

53616 04/11/2018 03:23 PM Giorgos Papanikos

Corrected Creation Date format

53615 04/11/2018 02:33 PM Giorgos Papanikos

deleted dead code. shouldn't have been commited in the first place

53614 04/11/2018 02:23 PM Giorgos Papanikos

Added schema.org harvesting plugin. Supports sitemapindex files and api listing calls to retrieve endpoints list