------------------------------------------------------------------------ r35409 | marek.horst | 2015-03-17 15:04:06 +0100 (Tue, 17 Mar 2015) | 1 line #1198 aligning IIS dependencies and java code to CDH5.3.0 cluster ------------------------------------------------------------------------ r35395 | marek.horst | 2015-03-17 15:01:04 +0100 (Tue, 17 Mar 2015) | 1 line #1197 introducing job.properties changes aligning paths to rumcajs cluster HDFS structure ------------------------------------------------------------------------ r35250 | marek.horst | 2015-03-11 16:48:11 +0100 (Wed, 11 Mar 2015) | 1 line creating IIS-CDH-5.3.0 branch ------------------------------------------------------------------------ r34616 | marek.horst | 2015-02-19 18:12:12 +0100 (Thu, 19 Feb 2015) | 1 line #1038 introducing ranges in dependencies definition for all IIS modules ------------------------------------------------------------------------ r33612 | marek.horst | 2014-12-16 20:53:30 +0100 (Tue, 16 Dec 2014) | 1 line [maven-release-plugin] prepare for next development iteration ------------------------------------------------------------------------ r33610 | marek.horst | 2014-12-16 20:53:26 +0100 (Tue, 16 Dec 2014) | 1 line [maven-release-plugin] prepare release icm-iis-documentssimilarity-1.0.0 ------------------------------------------------------------------------ r33605 | marek.horst | 2014-12-16 20:15:05 +0100 (Tue, 16 Dec 2014) | 1 line #1044 pre-release switching to released version of parent pom and released dependencies ------------------------------------------------------------------------ r33498 | marek.horst | 2014-12-15 19:01:20 +0100 (Mon, 15 Dec 2014) | 1 line #1044 moving coansys placeholder definition to documentssimilarity and citationmatching modules to eliminate necessity of releasing parentcontainer module every time coansys version changes. ------------------------------------------------------------------------ r33411 | marek.horst | 2014-12-15 12:42:38 +0100 (Mon, 15 Dec 2014) | 1 line introducing scm definition ------------------------------------------------------------------------ r33225 | marek.horst | 2014-12-08 14:48:27 +0100 (Mon, 08 Dec 2014) | 1 line #1026 setting threshold_num_of_vector_elems_length to 2 which proves to be solution for mentioned problem ------------------------------------------------------------------------ r33183 | marek.horst | 2014-12-04 16:06:29 +0100 (Thu, 04 Dec 2014) | 1 line #1026 introducing threshold_num_of_vector_elems_length parameter support which eliminate all documents with terms verctor shorter than specified threshold ------------------------------------------------------------------------ r32253 | marek.horst | 2014-11-05 18:42:11 +0100 (Wed, 05 Nov 2014) | 1 line introducing ${iis.coansys.version} placeholder for coansys version, upgrading value to 1.7-SNAPSHOT after todays coansys release ------------------------------------------------------------------------ r32239 | marek.horst | 2014-11-05 17:27:42 +0100 (Wed, 05 Nov 2014) | 1 line introducing embedded integration test entry ------------------------------------------------------------------------ r31036 | marek.horst | 2014-10-02 14:29:51 +0200 (Thu, 02 Oct 2014) | 1 line introducing cloudera repository in parent container, removing repository definitions from individual IIS modules ------------------------------------------------------------------------ r30100 | marek.horst | 2014-09-10 16:34:16 +0200 (Wed, 10 Sep 2014) | 1 line #768 fix: introducing missing mainDirectory parameter set to ${wf:appPath()}/coansys ------------------------------------------------------------------------ r30049 | marek.horst | 2014-09-08 11:39:14 +0200 (Mon, 08 Sep 2014) | 1 line updating job.properties ------------------------------------------------------------------------ r28765 | marek.horst | 2014-07-01 17:02:31 +0200 (Tue, 01 Jul 2014) | 1 line introducing deploy.info file for module icm-iis-documentssimilarity ------------------------------------------------------------------------ r28742 | marek.horst | 2014-07-01 14:35:30 +0200 (Tue, 01 Jul 2014) | 1 line moving icm-iis-* modules from dnet11 to dnet40 ------------------------------------------------------------------------ r27993 | marek.horst | 2014-06-05 14:01:47 +0200 (Thu, 05 Jun 2014) | 8 lines updating default similarity properties to: sample=1 tfidfTopnTermPerDocument=20 removal_least_used=20 removal_rate=0.99 similarityTopnDocumentPerDocument=20 mapredChildJavaOpts=-Xmx20g parallel=20 ------------------------------------------------------------------------ r27911 | marek.horst | 2014-06-03 10:33:21 +0200 (Tue, 03 Jun 2014) | 1 line updating default job.properties ------------------------------------------------------------------------ r27910 | marek.horst | 2014-06-03 10:31:57 +0200 (Tue, 03 Jun 2014) | 1 line setting remove_sideproducts=true by default ------------------------------------------------------------------------ r27908 | marek.horst | 2014-06-03 10:04:34 +0200 (Tue, 03 Jun 2014) | 1 line setting serialize_to_proto default value ------------------------------------------------------------------------ r27906 | marek.horst | 2014-06-03 09:54:20 +0200 (Tue, 03 Jun 2014) | 1 line updating default workflow.xml properties ------------------------------------------------------------------------ r27550 | marek.horst | 2014-05-16 14:32:19 +0200 (Fri, 16 May 2014) | 1 line introducing most recent version of document similarity workflow with updated set of parameters ------------------------------------------------------------------------ r27412 | marek.horst | 2014-05-14 10:09:29 +0200 (Wed, 14 May 2014) | 1 line updating default job.properties ------------------------------------------------------------------------ r27258 | marek.horst | 2014-05-09 11:28:44 +0200 (Fri, 09 May 2014) | 1 line updating converter input path after upgrading doc-sim version ------------------------------------------------------------------------ r27256 | marek.horst | 2014-05-08 22:55:53 +0200 (Thu, 08 May 2014) | 1 line switching to the latest version of coansys document similarity module ------------------------------------------------------------------------ r26568 | marek.horst | 2014-04-11 19:25:32 +0200 (Fri, 11 Apr 2014) | 1 line #332 workflow definitions cleanup. 2.4) prefixing documentssimilarity input/output port names ------------------------------------------------------------------------ r26518 | marek.horst | 2014-04-11 01:13:07 +0200 (Fri, 11 Apr 2014) | 1 line #352 replacing fixed version value 1.7.4 with iis.avro.version placeholder defined in parent pom ------------------------------------------------------------------------ r26489 | marek.horst | 2014-04-10 19:20:42 +0200 (Thu, 10 Apr 2014) | 1 line #349 make all IIS modules dnet-spring4 compliant: updating all pom.xml definitions with propert parent and updated dnet-spring4 SNAPSHOT dependencies. Updating java code by replacing IMDStoreService API with newly introduced MDStoreService API ------------------------------------------------------------------------ r26475 | marek.horst | 2014-04-10 18:51:41 +0200 (Thu, 10 Apr 2014) | 1 line updating job properties ------------------------------------------------------------------------ r26415 | marek.horst | 2014-04-08 11:28:38 +0200 (Tue, 08 Apr 2014) | 1 line updating ds_parallel to 30 to match openaire cluster configuration ------------------------------------------------------------------------ r26160 | marek.horst | 2014-03-27 18:09:46 +0100 (Thu, 27 Mar 2014) | 1 line updating default document similarity parameters ------------------------------------------------------------------------ r25986 | marek.horst | 2014-03-18 11:54:44 +0100 (Tue, 18 Mar 2014) | 1 line parameterizing ds_mapredChildJavaOpts and ds_sample ------------------------------------------------------------------------ r24606 | marek.horst | 2014-02-03 17:41:38 +0100 (Mon, 03 Feb 2014) | 1 line renaming pig_parallel parameter to ds_parallel ------------------------------------------------------------------------ r24603 | marek.horst | 2014-02-03 17:34:14 +0100 (Mon, 03 Feb 2014) | 1 line updating default similarity values ------------------------------------------------------------------------ r24599 | marek.horst | 2014-02-03 17:22:22 +0100 (Mon, 03 Feb 2014) | 1 line setting pig_parallel=40 ------------------------------------------------------------------------ r24578 | marek.horst | 2014-02-03 13:52:25 +0100 (Mon, 03 Feb 2014) | 1 line upgrading coansys similarity module from document-similarity-workflow to document-similarity-ranked-workflow ------------------------------------------------------------------------ r23998 | marek.horst | 2014-01-10 15:55:18 +0100 (Fri, 10 Jan 2014) | 1 line changing default ds_tfidfMinValue from 0.4 to 0.6 to limit results ------------------------------------------------------------------------ r23997 | marek.horst | 2014-01-10 15:54:47 +0100 (Fri, 10 Jan 2014) | 1 line updating default job properties ------------------------------------------------------------------------ r23961 | marek.horst | 2014-01-08 17:25:22 +0100 (Wed, 08 Jan 2014) | 2 lines handling similarityTopnDocumentPerDocument and tfidfTopnTermPerDocument doc-sim parameters provided at runtime ------------------------------------------------------------------------ r23898 | marek.horst | 2014-01-02 14:35:10 +0100 (Thu, 02 Jan 2014) | 1 line parameterizing ds_tfidfMinValue ------------------------------------------------------------------------ r23447 | marek.horst | 2013-12-16 19:22:41 +0100 (Mon, 16 Dec 2013) | 1 line updating default datastores in job properties ------------------------------------------------------------------------ r22901 | mateusz.fedoryszak | 2013-12-09 16:30:19 +0100 (Mon, 09 Dec 2013) | 1 line properties ------------------------------------------------------------------------ r22899 | mateusz.fedoryszak | 2013-12-09 16:28:39 +0100 (Mon, 09 Dec 2013) | 1 line new CoAnSys version ------------------------------------------------------------------------ r22794 | mateusz.fedoryszak | 2013-12-06 12:21:29 +0100 (Fri, 06 Dec 2013) | 1 line renaming io parameters ------------------------------------------------------------------------ r22632 | mateusz.fedoryszak | 2013-11-29 18:24:59 +0100 (Fri, 29 Nov 2013) | 1 line new format of input data ------------------------------------------------------------------------ r22570 | mateusz.fedoryszak | 2013-11-29 14:41:45 +0100 (Fri, 29 Nov 2013) | 1 line MiniOozie support ------------------------------------------------------------------------ r22568 | mateusz.fedoryszak | 2013-11-29 14:39:55 +0100 (Fri, 29 Nov 2013) | 1 line Pig parallel param ------------------------------------------------------------------------ r22556 | mateusz.fedoryszak | 2013-11-29 11:51:26 +0100 (Fri, 29 Nov 2013) | 1 line fixes ------------------------------------------------------------------------ r22555 | mateusz.fedoryszak | 2013-11-29 11:50:59 +0100 (Fri, 29 Nov 2013) | 1 line removing unnecessary lines ------------------------------------------------------------------------ r22445 | mateusz.fedoryszak | 2013-11-26 14:57:12 +0100 (Tue, 26 Nov 2013) | 1 line Moving generic converter to common ------------------------------------------------------------------------ r22420 | mateusz.fedoryszak | 2013-11-25 13:08:50 +0100 (Mon, 25 Nov 2013) | 1 line fixing property misuse ------------------------------------------------------------------------ r22410 | mateusz.fedoryszak | 2013-11-25 11:37:52 +0100 (Mon, 25 Nov 2013) | 1 line missing brackets ------------------------------------------------------------------------ r22409 | mateusz.fedoryszak | 2013-11-25 11:37:10 +0100 (Mon, 25 Nov 2013) | 1 line Generic converting mapper ------------------------------------------------------------------------ r22229 | mateusz.fedoryszak | 2013-11-18 11:07:35 +0100 (Mon, 18 Nov 2013) | 1 line somewhat works (no errors nor output) ------------------------------------------------------------------------ r21965 | mateusz.fedoryszak | 2013-11-13 10:04:18 +0100 (Wed, 13 Nov 2013) | 1 line basic converters ------------------------------------------------------------------------ r21733 | marek.horst | 2013-11-04 10:49:31 +0100 (Mon, 04 Nov 2013) | 1 line introducing "icm-iis-documentssimilarity" ------------------------------------------------------------------------ r21730 | marek.horst | 2013-11-04 10:47:29 +0100 (Mon, 04 Nov 2013) | 1 line Share project "icm-iis-documentssimilarity" into "https://svn.driver.research-infrastructures.eu/driver" ------------------------------------------------------------------------