code formatting
fixed class name for logs
Issue Enhancement #3858 and code cleanup in RestIterator
Kaggle/Reactome: added configurable params
added configurable producer timeout
Added safeguard parsing of produced xml to catch badly escaped illegal characters (eg etf: \001 escaped as ). Changed harvesting thread execution from ThreadExecutor to Thread.Start
Kaggle/Reactome: factored out file write procedure
added more logging and removed fair option from blocking queue
Updatged string identifier type reference to use enum
added main classes to verify the content collected from Kaggle and Reactome
fixed issue
fixed issue for stopping iteration execution
Added default empty dataset document serialization for endpoints where no dataset can be retrieved
optional parameters for the schema.org plugin
added all the possible parameters to the Schema.org ProtocolDescriptor
Corrected Creation Date format
deleted dead code. shouldn't have been commited in the first place
Added schema.org harvesting plugin. Supports sitemapindex files and api listing calls to retrieve endpoints list
enhancement of new resumptionType, Issue Enhancement #3858
fix JSON replacement with cleanUnwantedJsonCharsInXmlTagnames
use XmlCleaner for cleaning up XML results andprepare for next revision.
org.json.XML - update maven package version to 20180813better unicode support
add fix related to #3849 (control characters in xml files break the transformation)
org.json.XML - Workaround for JSON element names -> XML tagnames.remove resumptionParam&-Type from first 'query' URL.
Additional to discover option in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Additional comments, debugging output in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Additional comments, debugging output and small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Small changes in the Rest_Json CollectorPlugin for the enhancements of the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
Changes in the Rest_Json CollectorPlugin with enhancements for the new OpenDOAR API at JISC under https://v2.sherpa.ac.uk/opendoar/
use HttpConnector to download XML instead of VTDGen parse URL method
remove "\n" from all the cell contents
fix issue in input data
minor
changes in the implementation of the iterator
little adjustment to fix data format in input data
cleanup
fix for package name (HTTPWithFileName -> httpfilename and fixed issue on iterator for HTTPWithFileNameCollectorIterable
small refactor
used blocking methods in HTTPWithFileNameCollectorIterable
added information to the associated URL for junk metadata
fixed issue when .jos metadata extension contain xml content
code cleaning
changed implementation of data gathering
some logs for debugging reasons added
stupid mistake
check for malformed json
remove DOCTYPE from metadata xml
filtering metadata and added param in template to specify what to filter out
removed DOCTYPE from xml metadata document
change in the parameters of httpWithFilename plugin
modified update of information in xml metadata
considered the case metadata are given in xml format instead of json
commit after refactoring
added bean
fix bug
pluging for collecting metadata from files mapped to urls (related to #3236)
using commons-lang3
small adjustments in rest json plugin
fixed 'next' method of the iterator class and added new field entityPath
implemented missing method collect in RestCollectorPlugin; added test class; removed RestIteratorFactory class
declaring bean for RestCollectorPlugin
collector plugin for rest apis
Moved HttpConnector in common package
Using HttpConnector in re3data plugin
integrated latest changes from dnet40
codebase used to migrate to java8 the production system