Project

General

Profile

Statistics
| Revision:

# Date Author Comment
39089 08/09/2015 03:09 PM Marek Horst

renaming test resources to be compliant with windows file system naming requirements

39057 05/09/2015 09:42 PM Marek Horst

fixing destination id in expected citation record

39056 05/09/2015 09:22 PM Marek Horst

updating fundingtree value to xml representation and changing expected fundingclass as outcome

39043 04/09/2015 11:26 PM Marek Horst

#1498 introducing major citations related refactoring including new generic direct citation matching moved to processing phase, introduced position field in all citations schemas and updated collapser taking position into account when merging citations details coming from 3 variuos sources: fuzzy citationmatching, direct citationmatching, references metadata

37947 24/06/2015 12:13 PM Marek Horst

#1212 updating classification test expected results after fixing typo: dccclasses->ddcclasses in taxonomies.db

37780 15/06/2015 12:29 PM Marek Horst

updating expected classes, setting acm classes

37777 15/06/2015 11:14 AM Marek Horst

updating expected classes

37470 26/05/2015 10:30 AM Marek Horst

introducing missing pdb reference extraction missing parameters

37414 22/05/2015 05:32 PM Marek Horst

#1315 providing missing confidenceLevel

37408 22/05/2015 02:41 PM Marek Horst

#1315 providing missing confidenceLevel

37392 22/05/2015 01:13 PM Marek Horst

#1315 updating expected jsons in integration test after DocumentToConceptIds schema refactoring

36450 17/04/2015 04:38 PM Marek Horst

upgrading xmlns version to 0.4 in order to support global element

36286 09/04/2015 07:10 PM Marek Horst

#1257 dropping schema generation related hacks in all map-reduce modules, switching to literal schema parameters

35701 27/03/2015 06:18 AM Mateusz Kobos

Removing usage of working_dir from Java workflow node.

35232 11/03/2015 02:19 PM Marek Horst

reenabling document to project reference import validation

35231 11/03/2015 01:55 PM Marek Horst

updating expected documents list

35230 11/03/2015 01:30 PM Marek Horst

temporarily skipping docproject validation

35229 11/03/2015 01:14 PM Marek Horst

#1195 removing obsolete ports docreation and datasetid from hbase mapred import, removing references to those ports in workflow.xml files, updating transformer by removing filtering by datasetid due to decisions made in #1072

35200 09/03/2015 07:07 PM Marek Horst

fixing json escape character by putting \\ in place of \

35199 09/03/2015 06:45 PM Marek Horst

extending mapreduce metadata importer test with validating import of different kind of relations and dataset identifier

35198 09/03/2015 06:44 PM Marek Horst

extending mapreduce metadata importer test with validating import of different kind of relations and dataset identifier

35191 09/03/2015 04:52 PM Marek Horst

removing obsolete citations

35189 09/03/2015 04:24 PM Marek Horst

updating confidence level value to 1.0 for record coming from PMC

35187 09/03/2015 04:21 PM Marek Horst

removing obsolete pdf directory

35183 09/03/2015 03:16 PM Marek Horst

adding missing "confidenceLevel" field

35153 06/03/2015 06:25 PM Marek Horst

maintaining pmc citation and testing citations merging process

35152 06/03/2015 05:36 PM Marek Horst

reintroducing multiple citations after introducing sorting in transformer

35149 06/03/2015 05:25 PM Marek Horst

limiting citations count to 1 until results order produced by citation matching module is repetitive

35146 06/03/2015 04:13 PM Marek Horst

including: FET project reference extraction, EGI case, dataset reference extraction outcome validation

35144 06/03/2015 03:06 PM Marek Horst

enabling citation matching algorithm

35143 06/03/2015 03:06 PM Marek Horst

updating expected citations

35122 05/03/2015 06:47 PM Marek Horst

removing comment

35120 05/03/2015 06:46 PM Marek Horst

primary processing integration test major refactoring: dropping cermine execution and providing plaintext and extracted metadata as json records

34876 27/02/2015 04:08 PM Marek Horst

overriding memory parameter due to test cluster memory limitations, setting it to Xmx256m

34875 27/02/2015 03:24 PM Marek Horst

overriding memory parameter due to test cluster memory limitations, setting it to Xmx128m

34871 27/02/2015 02:49 PM Marek Horst

overriding memory parameter due to test cluster memory limitations, setting it to Xmx512m

34869 27/02/2015 02:48 PM Marek Horst

updating expected classes in integration test after recent #720 change and fixing confidence level distribution

34804 25/02/2015 07:19 PM Marek Horst

overriding memory parameter due to test cluster memory limitations

34702 20/02/2015 07:17 PM Marek Horst

#1133 dropping useless workfing_dir creation for java nodes

33249 09/12/2014 06:41 PM Marek Horst

#919 renaming DocumentToResearchInitiative to DocumentToConceptId and DocumentToResearchInitiatives to DocumentToConceptIds

33218 05/12/2014 04:26 PM Marek Horst

#919 adding missing i/o ports related to FET projects reference extraction

32825 17/11/2014 03:43 PM Marek Horst

introducing separate citations json containing expected results, not enabled in workflow yet

31846 28/10/2014 03:45 PM Marek Horst

fixing citations schema type

31647 22/10/2014 06:31 PM Marek Horst

enabling document classification and reserach initiatives reference extraction algorithms

31250 09/10/2014 03:33 PM Marek Horst

introducing regex support in result approver to support iis::* kind of provenance, updating workflow definitions with proper regex values

31228 08/10/2014 06:19 PM Marek Horst

#840 moving IdentifierMapping from importer to common package

31222 08/10/2014 06:12 PM Marek Horst

#840 renaming DeduplicationMapping to more generic IdentifierMapping

31034 02/10/2014 02:15 PM Marek Horst

removing extracted_metadata.json which will not be checked anymore

30938 29/09/2014 06:18 PM Marek Horst

skipping extracted_metadata comparison which is cumbersome due to frequent changes and large volume of references

30885 25/09/2014 06:40 PM Marek Horst

introducing newly added address field in json record

30876 25/09/2014 05:03 PM Marek Horst

fixing field names after recent Affiliation.avdl refactoring and adding countryCode field, renaming contry to countryName

29895 28/08/2014 04:32 PM Marek Horst

updating expected output

29854 25/08/2014 06:06 PM Marek Horst

moving ACM importer to icm-iis-mainworkflows due to extending dependances with cermine, introducing performance tests

29645 29/07/2014 10:43 AM Marek Horst

updating expected record content

29398 21/07/2014 04:21 PM Marek Horst

updating expected extracted metadata

29300 19/07/2014 12:43 AM Mateusz Kobos

Fixing names of parameters accepted by workflow nodes

29017 11/07/2014 10:29 AM Marek Horst

#486 fixing integration test: introducing missing document_text_wos input port for primary/processing

28872 03/07/2014 02:00 PM Marek Horst

updating expected references output for doc=id-3

28817 02/07/2014 02:47 PM Marek Horst

fixing affiliations and positions in authors details

28816 02/07/2014 02:36 PM Marek Horst

fixing HBase model json representation to be compliant with most recent dnet-openaire-data-protos:3.0.0-SNAPSHOT model: complex relation identifiers, dataInfo on fields level etc