Project

General

Profile

Statistics
| Revision:

# Date Author Comment
50020 29/11/2017 02:24 PM Marek Horst

updating IIS cache builder profile by removing obsolete properties which are currently defined in config-default.xml file on IIS cluster

48879 05/09/2017 03:45 PM Claudio Atzori
47622 22/06/2017 02:53 PM Claudio Atzori

import_hbase_dump_location parameter must by passed by the wf

45966 21/02/2017 12:52 PM Marek Horst

dropping _default part from cache location parameters

45961 20/02/2017 04:01 PM Alessia Bardi

updated iis cache builder library version

45640 31/01/2017 11:24 AM Claudio Atzori

added LOD export configuration job profile

45044 29/12/2016 05:02 PM Marek Horst

introducing ingest_pmc_default_cache_location parameter required by newly introduced pmc ingestion caching mechanism

44563 17/11/2016 11:29 AM Marek Horst

updating metadataextraction cache location

44385 03/11/2016 06:26 PM Marek Horst

removing obsolete properties which are controlled on IIS side in default-config.xml file, more details in #2177#note-8

44086 14/10/2016 03:40 PM Alessia Bardi

added 'reports_external_path' property as indicated by Marek in #1356#note-19

43737 19/09/2016 05:34 PM Claudio Atzori

updated iis V2 workflows definitions and relative action set profiles

43639 13/09/2016 05:19 PM Claudio Atzori

updated IIS CDH5 specific job profiles

43638 13/09/2016 05:13 PM Claudio Atzori

updated IIS CDH5 specific job profiles

43613 09/09/2016 09:52 AM Claudio Atzori

refinements in the cache builder workflow

43600 07/09/2016 05:34 PM Claudio Atzori

updated action set profiles, introduced iisCacheBuilderJob, CDH5 specific inference Jobs

43597 07/09/2016 05:12 PM Alessia Bardi

datasource with type "websource" will end up with typeui "other"

43430 25/07/2016 04:31 PM Alessia Bardi

#2192: I can do it...job properties have key, not name

43429 25/07/2016 04:29 PM Alessia Bardi

#2192: fixed profile and more logs

43428 25/07/2016 04:21 PM Alessia Bardi

#2192: entityregistry::* should end up with "other" datas ource type label for the portal. PrepareReduceFeeder now expects a 'ui.other.datasourcetypes' job param with the list of datasource types to be handled this way.

43187 12/07/2016 10:33 AM Claudio Atzori

[broker] hadoop job profile

42424 03/05/2016 11:17 AM Claudio Atzori

removed empty SCAN element

42409 02/05/2016 04:29 PM Claudio Atzori

added more jobs

42365 02/05/2016 02:31 PM Claudio Atzori

fixed extra char

42364 02/05/2016 02:30 PM Claudio Atzori

deleteSimRelJob updated to deleteDedupRelsJob

42363 02/05/2016 02:29 PM Claudio Atzori

added new M/R Jobs

42295 22/04/2016 05:09 PM Claudio Atzori

making schema validation happy

42292 22/04/2016 05:01 PM Claudio Atzori

making schema validation happy

42246 15/04/2016 06:04 PM Claudio Atzori

introduced HDFS Action related job profiles

41761 18/03/2016 06:05 PM Claudio Atzori

added anchorStats job

41071 27/01/2016 07:00 PM Claudio Atzori

added hadoop job profile for the openaire identifiers export workflow

40551 18/12/2015 05:37 PM Alessia Bardi

document similarity threshold set to 0.7 instead of 0.8.

40510 17/12/2015 11:46 AM Alessia Bardi

#1772: changed default trust thresholds

40246 04/12/2015 05:53 PM Claudio Atzori

using new metadata cache location

40245 04/12/2015 05:52 PM Claudio Atzori

introduced new hadoop job profiles (dedup)

39623 19/10/2015 11:35 AM Michele Artini

use of external properties

39567 14/10/2015 11:58 AM Michele Artini

use of Text instead of ImmutableBytesWritable

39562 13/10/2015 04:56 PM Michele Artini

reimplemented calculatePersonDistribution M/R job to consider only the results from pubsrepositories (not journals)

39556 13/10/2015 12:55 PM Claudio Atzori

added default threshold parameters. #1209

39298 18/09/2015 04:25 PM Claudio Atzori

informationSpaceImportJob

39296 18/09/2015 03:35 PM Claudio Atzori

updated compression parameters

39268 17/09/2015 04:39 PM Claudio Atzori

compressing output

39222 14/09/2015 06:06 PM Claudio Atzori

added information space export job

38368 20/07/2015 02:49 PM Claudio Atzori

added hadoop jobs (dedup person)

38059 06/07/2015 12:06 PM Claudio Atzori

MapDocument implements a more general view of the pace model

38043 02/07/2015 10:31 AM Alessia Bardi

added trust level threshold for document similarity and document classes

38042 01/07/2015 07:32 PM Alessia Bardi

new parameter for pdb inference module

37852 18/06/2015 05:13 PM Claudio Atzori

added coauthor workflow and hadoop job

37779 15/06/2015 11:41 AM Michele Artini

profiles to run calculate Person Distribution

37776 15/06/2015 10:25 AM Claudio Atzori

updated job props

37751 12/06/2015 04:05 PM Claudio Atzori

added workflow to export the representative publications as json on hdfs

37649 05/06/2015 06:51 PM Claudio Atzori

updated primary iis job profile and workflow to the latest specs

36726 27/04/2015 11:14 AM Claudio Atzori

merged branch dedupConf

35778 30/03/2015 12:13 PM Marek Horst

#953 blacklisting da458477233b5561ae47042aa2a73086 content

35756 28/03/2015 11:33 AM Marek Horst

#953 adding bea4728578070c3d66774bf9454d41fe checksum to blacklisted

34881 27/02/2015 04:25 PM Claudio Atzori

attempt to define custom user names #1153

34553 18/02/2015 10:55 AM Claudio Atzori

including merge relationship in duplicate scan phase

34388 10/02/2015 11:21 AM Alessia Bardi

wf and hadoop job updates to support the exclusion of persons and duplicate records during the OAI feed.

33679 18/12/2014 06:13 PM Claudio Atzori

indentation

33240 09/12/2014 03:18 PM Claudio Atzori

added scheduler pool name

32862 18/11/2014 03:48 PM Claudio Atzori

updated profiles

32841 17/11/2014 06:35 PM Claudio Atzori

stats conf moved in the resp. HadoopJobConfiguration profile.

32285 06/11/2014 08:22 PM Marek Horst

updating metadataextraction_excluded_checksums to 1e5b574109da731f4918c7f91fc24864 value

32178 04/11/2014 07:44 PM Claudio Atzori

updated job profile

32168 04/11/2014 02:07 PM Marek Horst

setting metadataextraction_excluded_checksums to $UNDEFINED$ which means no documents should be excluded

32149 04/11/2014 11:08 AM Claudio Atzori

updated copytable job definition

31988 30/10/2014 07:56 PM Marek Horst

renaming input parameter: metadataextraction_excluded_ids -> metadataextraction_excluded_checksums

31214 08/10/2014 05:44 PM Claudio Atzori

added copytable job profile

31159 06/10/2014 05:12 PM Claudio Atzori

added scanner caching

30040 05/09/2014 06:07 PM Claudio Atzori

updated required parameters

29989 03/09/2014 07:05 PM Claudio Atzori

added flags to enable/disable metadata extraction module

29972 03/09/2014 12:00 PM Claudio Atzori

added action sets dedicated to each inference module

29966 03/09/2014 10:46 AM Claudio Atzori

updated jobs specs

29578 25/07/2014 05:28 PM Claudio Atzori

Context profiles will be fetched by the oozie process, so we pass the isLookupEndpoint as wf param

29396 21/07/2014 03:54 PM Claudio Atzori

updated job definition

29388 21/07/2014 09:33 AM Claudio Atzori

added stats update job profile

28994 10/07/2014 05:06 PM Claudio Atzori

submittable M/R OAI feeding job

28061 06/06/2014 05:55 PM Claudio Atzori

updated dedup/indexing configuration and the relative job definitions

27568 16/05/2014 05:04 PM Claudio Atzori

updated IIS job interfaces

27525 15/05/2014 06:46 PM Claudio Atzori

improved parameters management

27200 06/05/2014 06:06 PM Claudio Atzori

map only job configuration used to feed the oai store

26981 18/04/2014 06:21 PM Claudio Atzori

updated iisMain workflow configuration profile