/modules/dnet-mapreduce-jobs/trunk/src/main/java - Changes - D-Net - D-Net project tracking tool

dnet45/modules/dnet-mapreduce-jobs/trunk/src/main/java @ 56081

#	Date	Author	Comment
56081	14/06/2019 02:30 PM	Miriam Baglioni	added logic for selection criteria implementation on datasources
56079	14/06/2019 12:48 PM	Miriam Baglioni	export of link between publication and software (related to #4593)
56016	06/06/2019 04:53 PM	Enrico Ottonello
56011	06/06/2019 12:12 PM	Alessia Bardi	print json but commented out. Added test to get a proto from a json
55990	04/06/2019 06:10 PM	Enrico Ottonello	added resourcetype and resulttype, new work type mapping
55895	29/05/2019 10:30 AM	Enrico Ottonello	code cleaning
55887	28/05/2019 06:35 PM	Alessia Bardi	code adapted to new version of mapping utils 6.3.25 which supports journal information also on data sources
55844	28/05/2019 04:20 PM	Enrico Ottonello	this mapper handles orcid publications (without doi) actions
55522	07/05/2019 06:31 PM	Enrico Ottonello	generate actionset for orcid works and unit tests
55349	15/04/2019 10:23 AM	Alessia Bardi	Avoid nullpointer for publisher
55246	09/04/2019 06:06 PM	Alessia Bardi	Discard records without a valid author as requested in #4392, #4393, #4395, #4396. If the record has also at least one valid author, the record is kept but the invalid authors are removed.
55239	08/04/2019 07:08 PM	Alessia Bardi	Workaround for #4362: instances from Unpaywall mapped into licenses and then lost
55238	08/04/2019 06:08 PM	Alessia Bardi	Addressing quality of the research graph: #4368 and #4360.
55144	01/04/2019 04:53 PM	Miriam Baglioni	add provenance to bulktagging
55076	22/03/2019 06:26 PM	Alessia Bardi	updated label for a counter that I hope I will never see
54872	20/02/2019 03:05 PM	Miriam Baglioni	removed useless comment
54871	20/02/2019 02:57 PM	Miriam Baglioni	refactoring and update of default variable values
54839	15/02/2019 07:41 PM	Miriam Baglioni	remove from context association to Zenodo Community and refactoring of code
54836	15/02/2019 06:26 PM	Miriam Baglioni	propagation of country from institutional repository and of result projects through semantic link
54832	15/02/2019 06:03 PM	Miriam Baglioni	update for zenodo community bulk tagging
54831	15/02/2019 05:52 PM	Sandro La Bruzzo	fixed DOIBoost Bug
54794	14/02/2019 06:44 PM	Claudio Atzori	fixed NPE in OrcidEventFactory, improved serialisation of the ORCID in the OpenAIRE event payload
54765	13/02/2019 11:17 AM	Claudio Atzori	added first implementation for OrcidEventFactory
54764	13/02/2019 11:02 AM	Claudio Atzori	Implemented ORCID event generation process and relative configuration profile Added workflow to orchestrate the event generation for software links
54512	28/12/2018 04:41 PM	Claudio Atzori	aligning with MASTER branch
54507	28/12/2018 03:26 PM	Claudio Atzori	migrated classes for the FixRelation job from MASTER branch
54506	28/12/2018 03:16 PM	Claudio Atzori	replaced CrossRef with Crossref
54273	10/12/2018 03:30 PM	Claudio Atzori	logging index feed retry number
53802	15/11/2018 06:08 PM	Claudio Atzori	branch for solr 7.5.0
53800	15/11/2018 05:38 PM	Claudio Atzori	import form master branch
53779	15/11/2018 12:25 PM	Claudio Atzori	avoid to import non necessary affiliations from DOIBoost
53737	13/11/2018 05:21 PM	Sandro La Bruzzo	fixed bug for uncompressed abstract in DOIBoostToAction
53736	13/11/2018 04:55 PM	Sandro La Bruzzo	changed mapping for compressed abstract in DOIBoostToAction
53726	13/11/2018 09:08 AM	Claudio Atzori	less verbose logging
53720	12/11/2018 03:54 PM	Claudio Atzori	using proper import for LogFactory
53710	09/11/2018 07:01 PM	Claudio Atzori	added Mapper and Reducer class for infoSpace counts workflows
53687	09/11/2018 02:17 PM	Claudio Atzori	added reducer to produce counts on the infospace
53684	09/11/2018 12:28 PM	Claudio Atzori	added reducer to produce counts on the infospace
53628	05/11/2018 04:41 PM	Claudio Atzori	cannot use Guava's Splitter.splitToList, must stick to basic split method. Classpath is messed up
53604	31/10/2018 11:34 AM	Claudio Atzori	implemented ConfigurableExportMapper
53602	31/10/2018 10:36 AM	Sandro La Bruzzo	Removed un-used import
53592	30/10/2018 12:36 PM	Sandro La Bruzzo	fixed test
53591	30/10/2018 12:30 PM	Sandro La Bruzzo	updated Mapper to return the whole invalid record
53589	30/10/2018 12:27 PM	Claudio Atzori	export invalid xml records
53588	30/10/2018 12:26 PM	Sandro La Bruzzo	refactored Action
53572	29/10/2018 10:12 AM	Claudio Atzori	Map only job that produces [openaireId, doi] pairs of records containing invalid characters
53565	26/10/2018 03:40 PM	Sandro La Bruzzo	added parameter to filter only organization in DOIBoostToAction
53556	25/10/2018 04:30 PM	Sandro La Bruzzo	fixed problem of missing name in authors
53554	25/10/2018 12:26 PM	Sandro La Bruzzo	merged beta branch to master
53518	18/10/2018 02:48 PM	Claudio Atzori	introduced use of BlockProcessor
53421	09/10/2018 11:55 AM	Miriam Baglioni	fixed issue when country information is not present for datasource
53419	08/10/2018 10:14 AM	Miriam Baglioni	change throwing of exception with counters
53410	05/10/2018 04:18 PM	Miriam Baglioni	change parameter from ImmutableBytesWritable to Text
53409	05/10/2018 04:17 PM	Miriam Baglioni	refactoring and change of counters
53407	05/10/2018 04:06 PM	Claudio Atzori	rollback wrong commit
53386	04/10/2018 03:35 PM	Claudio Atzori	fixing and testing propagation implementation
53383	04/10/2018 02:46 PM	Miriam Baglioni	reducer for country propagation that writes on hdfs
53371	03/10/2018 10:22 AM	Claudio Atzori	cleanup pid types in order to make them valid attributes
53369	02/10/2018 02:08 PM	Miriam Baglioni
53362	02/10/2018 10:18 AM	Miriam Baglioni	added code for propagation of countries from institutional organization
53340	01/10/2018 10:04 AM	Claudio Atzori	master branch for deployments @ICM
53336	01/10/2018 09:26 AM	Claudio Atzori	why parse strings as Floats?
53288	27/09/2018 01:48 PM	Claudio Atzori	reverted to r52985 . Test runs shows we need to rely on the edgeIds produced by the connected components identfication phase instead of the vertexIds
53279	26/09/2018 03:59 PM	Miriam Baglioni	alignment to trunk version
53262	25/09/2018 05:41 PM	Claudio Atzori	avoid to produce duplicated events by eliminating the roots from the comparison process
53260	25/09/2018 03:26 PM	Claudio Atzori	introduced mapping resulttype -> portal url
53213	21/09/2018 12:36 PM	Alessia Bardi	Fixed log class name
53191	19/09/2018 05:46 PM	Claudio Atzori	avoid collisions when hashing pids by value
53190	19/09/2018 05:33 PM	Claudio Atzori	cleaned up unused method, using setDurability in put operation
53103	12/09/2018 03:03 PM	Claudio Atzori	added mapper and hadoop job configuration file for importing Grid.AC organization data
53079	11/09/2018 06:31 PM	Claudio Atzori	integrating bulktag from trunk to beta branch
53068	11/09/2018 03:27 PM	Claudio Atzori	rule out invalid dates also on CrossRefToActions
53067	11/09/2018 03:22 PM	Claudio Atzori	rule out invalid dates on ScholixToActions
53036	10/09/2018 10:17 AM	Claudio Atzori	cleanup
53035	10/09/2018 10:16 AM	Claudio Atzori	produce 'supplement' subrel type in case of supplement relationships
53025	05/09/2018 02:33 PM	Claudio Atzori	simplified connected component application on the graph
52993	28/08/2018 05:06 PM	Sandro La Bruzzo	adding check to understand the bug of wrong relation generated
52985	27/08/2018 10:07 AM	Claudio Atzori	do not skip processing datasets in DedupBuildRootsMapper, improved error reporting in DedupBuildRootsReducer
52984	27/08/2018 10:00 AM	Claudio Atzori	do not push vertex ids in memory, process them on the fly
52960	08/08/2018 12:36 PM	Claudio Atzori	added jobs for predatory journal analysis
52958	07/08/2018 06:15 PM	Sandro La Bruzzo	added invisible setup
52957	07/08/2018 06:12 PM	Sandro La Bruzzo	refactored Action
52956	07/08/2018 06:07 PM	Sandro La Bruzzo	fixed null element
52955	07/08/2018 05:51 PM	Sandro La Bruzzo	Created CrossrefImportMapper
52951	07/08/2018 05:30 PM	Sandro La Bruzzo	add CrossRefToAction
52935	07/08/2018 11:29 AM	Claudio Atzori	fixed mapping from scholix to openaire model
52931	07/08/2018 09:39 AM	Claudio Atzori	small fixes
52930	06/08/2018 06:35 PM	Sandro La Bruzzo	changed key type
52929	06/08/2018 06:29 PM	Sandro La Bruzzo	changed key type
52916	06/08/2018 05:32 PM	Sandro La Bruzzo	implemented mapper writing
52915	06/08/2018 04:52 PM	Sandro La Bruzzo	added configuration
52912	06/08/2018 04:09 PM	Sandro La Bruzzo	added Mapper for tranform scholexplorer links into actionsets
52883	02/08/2018 04:25 PM	Claudio Atzori	deprecation: use setDurability instead of setWriteToWAL
52878	02/08/2018 02:19 PM	Claudio Atzori	introduced subType in pace wf configuration
52823	25/07/2018 04:10 PM	Claudio Atzori	adjusted ids export procedure
52805	24/07/2018 05:22 PM	Claudio Atzori	avoid to emit enrichment events when the similarity score is below the threshold
52804	24/07/2018 02:56 PM	Claudio Atzori	avoid to emit enrichment events when the similarity score is below the threshold
52803	24/07/2018 02:53 PM	Claudio Atzori	avoid to emit enrichment events when the similarity score is below the threshold
52802	24/07/2018 02:04 PM	Claudio Atzori	javadoc and test
52801	24/07/2018 12:14 PM	Claudio Atzori	indentation

Project

General

Profile

D-Net