1 |
37141
|
marek.hors
|
------------------------------------------------------------------------
|
2 |
|
|
r35409 | marek.horst | 2015-03-17 15:04:06 +0100 (Tue, 17 Mar 2015) | 1 line
|
3 |
|
|
|
4 |
|
|
#1198 aligning IIS dependencies and java code to CDH5.3.0 cluster
|
5 |
|
|
------------------------------------------------------------------------
|
6 |
|
|
r35395 | marek.horst | 2015-03-17 15:01:04 +0100 (Tue, 17 Mar 2015) | 1 line
|
7 |
|
|
|
8 |
|
|
#1197 introducing job.properties changes aligning paths to rumcajs cluster HDFS structure
|
9 |
|
|
------------------------------------------------------------------------
|
10 |
|
|
r35250 | marek.horst | 2015-03-11 16:48:11 +0100 (Wed, 11 Mar 2015) | 1 line
|
11 |
|
|
|
12 |
|
|
creating IIS-CDH-5.3.0 branch
|
13 |
|
|
------------------------------------------------------------------------
|
14 |
|
|
r34616 | marek.horst | 2015-02-19 18:12:12 +0100 (Thu, 19 Feb 2015) | 1 line
|
15 |
|
|
|
16 |
|
|
#1038 introducing ranges in dependencies definition for all IIS modules
|
17 |
|
|
------------------------------------------------------------------------
|
18 |
|
|
r33612 | marek.horst | 2014-12-16 20:53:30 +0100 (Tue, 16 Dec 2014) | 1 line
|
19 |
|
|
|
20 |
|
|
[maven-release-plugin] prepare for next development iteration
|
21 |
|
|
------------------------------------------------------------------------
|
22 |
|
|
r33610 | marek.horst | 2014-12-16 20:53:26 +0100 (Tue, 16 Dec 2014) | 1 line
|
23 |
|
|
|
24 |
|
|
[maven-release-plugin] prepare release icm-iis-documentssimilarity-1.0.0
|
25 |
|
|
------------------------------------------------------------------------
|
26 |
|
|
r33605 | marek.horst | 2014-12-16 20:15:05 +0100 (Tue, 16 Dec 2014) | 1 line
|
27 |
|
|
|
28 |
|
|
#1044 pre-release switching to released version of parent pom and released dependencies
|
29 |
|
|
------------------------------------------------------------------------
|
30 |
|
|
r33498 | marek.horst | 2014-12-15 19:01:20 +0100 (Mon, 15 Dec 2014) | 1 line
|
31 |
|
|
|
32 |
|
|
#1044 moving coansys placeholder definition to documentssimilarity and citationmatching modules to eliminate necessity of releasing parentcontainer module every time coansys version changes.
|
33 |
|
|
------------------------------------------------------------------------
|
34 |
|
|
r33411 | marek.horst | 2014-12-15 12:42:38 +0100 (Mon, 15 Dec 2014) | 1 line
|
35 |
|
|
|
36 |
|
|
introducing scm definition
|
37 |
|
|
------------------------------------------------------------------------
|
38 |
|
|
r33225 | marek.horst | 2014-12-08 14:48:27 +0100 (Mon, 08 Dec 2014) | 1 line
|
39 |
|
|
|
40 |
|
|
#1026 setting threshold_num_of_vector_elems_length to 2 which proves to be solution for mentioned problem
|
41 |
|
|
------------------------------------------------------------------------
|
42 |
|
|
r33183 | marek.horst | 2014-12-04 16:06:29 +0100 (Thu, 04 Dec 2014) | 1 line
|
43 |
|
|
|
44 |
|
|
#1026 introducing threshold_num_of_vector_elems_length parameter support which eliminate all documents with terms verctor shorter than specified threshold
|
45 |
|
|
------------------------------------------------------------------------
|
46 |
|
|
r32253 | marek.horst | 2014-11-05 18:42:11 +0100 (Wed, 05 Nov 2014) | 1 line
|
47 |
|
|
|
48 |
|
|
introducing ${iis.coansys.version} placeholder for coansys version, upgrading value to 1.7-SNAPSHOT after todays coansys release
|
49 |
|
|
------------------------------------------------------------------------
|
50 |
|
|
r32239 | marek.horst | 2014-11-05 17:27:42 +0100 (Wed, 05 Nov 2014) | 1 line
|
51 |
|
|
|
52 |
|
|
introducing embedded integration test entry
|
53 |
|
|
------------------------------------------------------------------------
|
54 |
|
|
r31036 | marek.horst | 2014-10-02 14:29:51 +0200 (Thu, 02 Oct 2014) | 1 line
|
55 |
|
|
|
56 |
|
|
introducing cloudera repository in parent container, removing repository definitions from individual IIS modules
|
57 |
|
|
------------------------------------------------------------------------
|
58 |
|
|
r30100 | marek.horst | 2014-09-10 16:34:16 +0200 (Wed, 10 Sep 2014) | 1 line
|
59 |
|
|
|
60 |
|
|
#768 fix: introducing missing mainDirectory parameter set to ${wf:appPath()}/coansys
|
61 |
|
|
------------------------------------------------------------------------
|
62 |
|
|
r30049 | marek.horst | 2014-09-08 11:39:14 +0200 (Mon, 08 Sep 2014) | 1 line
|
63 |
|
|
|
64 |
|
|
updating job.properties
|
65 |
|
|
------------------------------------------------------------------------
|
66 |
|
|
r28765 | marek.horst | 2014-07-01 17:02:31 +0200 (Tue, 01 Jul 2014) | 1 line
|
67 |
|
|
|
68 |
|
|
introducing deploy.info file for module icm-iis-documentssimilarity
|
69 |
|
|
------------------------------------------------------------------------
|
70 |
|
|
r28742 | marek.horst | 2014-07-01 14:35:30 +0200 (Tue, 01 Jul 2014) | 1 line
|
71 |
|
|
|
72 |
|
|
moving icm-iis-* modules from dnet11 to dnet40
|
73 |
|
|
------------------------------------------------------------------------
|
74 |
|
|
r27993 | marek.horst | 2014-06-05 14:01:47 +0200 (Thu, 05 Jun 2014) | 8 lines
|
75 |
|
|
|
76 |
|
|
updating default similarity properties to:
|
77 |
|
|
sample=1
|
78 |
|
|
tfidfTopnTermPerDocument=20
|
79 |
|
|
removal_least_used=20
|
80 |
|
|
removal_rate=0.99
|
81 |
|
|
similarityTopnDocumentPerDocument=20
|
82 |
|
|
mapredChildJavaOpts=-Xmx20g
|
83 |
|
|
parallel=20
|
84 |
|
|
------------------------------------------------------------------------
|
85 |
|
|
r27911 | marek.horst | 2014-06-03 10:33:21 +0200 (Tue, 03 Jun 2014) | 1 line
|
86 |
|
|
|
87 |
|
|
updating default job.properties
|
88 |
|
|
------------------------------------------------------------------------
|
89 |
|
|
r27910 | marek.horst | 2014-06-03 10:31:57 +0200 (Tue, 03 Jun 2014) | 1 line
|
90 |
|
|
|
91 |
|
|
setting remove_sideproducts=true by default
|
92 |
|
|
------------------------------------------------------------------------
|
93 |
|
|
r27908 | marek.horst | 2014-06-03 10:04:34 +0200 (Tue, 03 Jun 2014) | 1 line
|
94 |
|
|
|
95 |
|
|
setting serialize_to_proto default value
|
96 |
|
|
------------------------------------------------------------------------
|
97 |
|
|
r27906 | marek.horst | 2014-06-03 09:54:20 +0200 (Tue, 03 Jun 2014) | 1 line
|
98 |
|
|
|
99 |
|
|
updating default workflow.xml properties
|
100 |
|
|
------------------------------------------------------------------------
|
101 |
|
|
r27550 | marek.horst | 2014-05-16 14:32:19 +0200 (Fri, 16 May 2014) | 1 line
|
102 |
|
|
|
103 |
|
|
introducing most recent version of document similarity workflow with updated set of parameters
|
104 |
|
|
------------------------------------------------------------------------
|
105 |
|
|
r27412 | marek.horst | 2014-05-14 10:09:29 +0200 (Wed, 14 May 2014) | 1 line
|
106 |
|
|
|
107 |
|
|
updating default job.properties
|
108 |
|
|
------------------------------------------------------------------------
|
109 |
|
|
r27258 | marek.horst | 2014-05-09 11:28:44 +0200 (Fri, 09 May 2014) | 1 line
|
110 |
|
|
|
111 |
|
|
updating converter input path after upgrading doc-sim version
|
112 |
|
|
------------------------------------------------------------------------
|
113 |
|
|
r27256 | marek.horst | 2014-05-08 22:55:53 +0200 (Thu, 08 May 2014) | 1 line
|
114 |
|
|
|
115 |
|
|
switching to the latest version of coansys document similarity module
|
116 |
|
|
------------------------------------------------------------------------
|
117 |
|
|
r26568 | marek.horst | 2014-04-11 19:25:32 +0200 (Fri, 11 Apr 2014) | 1 line
|
118 |
|
|
|
119 |
|
|
#332 workflow definitions cleanup. 2.4) prefixing documentssimilarity input/output port names
|
120 |
|
|
------------------------------------------------------------------------
|
121 |
|
|
r26518 | marek.horst | 2014-04-11 01:13:07 +0200 (Fri, 11 Apr 2014) | 1 line
|
122 |
|
|
|
123 |
|
|
#352 replacing fixed version value 1.7.4 with iis.avro.version placeholder defined in parent pom
|
124 |
|
|
------------------------------------------------------------------------
|
125 |
|
|
r26489 | marek.horst | 2014-04-10 19:20:42 +0200 (Thu, 10 Apr 2014) | 1 line
|
126 |
|
|
|
127 |
|
|
#349 make all IIS modules dnet-spring4 compliant: updating all pom.xml definitions with propert parent and updated dnet-spring4 SNAPSHOT dependencies. Updating java code by replacing IMDStoreService API with newly introduced MDStoreService API
|
128 |
|
|
------------------------------------------------------------------------
|
129 |
|
|
r26475 | marek.horst | 2014-04-10 18:51:41 +0200 (Thu, 10 Apr 2014) | 1 line
|
130 |
|
|
|
131 |
|
|
updating job properties
|
132 |
|
|
------------------------------------------------------------------------
|
133 |
|
|
r26415 | marek.horst | 2014-04-08 11:28:38 +0200 (Tue, 08 Apr 2014) | 1 line
|
134 |
|
|
|
135 |
|
|
updating ds_parallel to 30 to match openaire cluster configuration
|
136 |
|
|
------------------------------------------------------------------------
|
137 |
|
|
r26160 | marek.horst | 2014-03-27 18:09:46 +0100 (Thu, 27 Mar 2014) | 1 line
|
138 |
|
|
|
139 |
|
|
updating default document similarity parameters
|
140 |
|
|
------------------------------------------------------------------------
|
141 |
|
|
r25986 | marek.horst | 2014-03-18 11:54:44 +0100 (Tue, 18 Mar 2014) | 1 line
|
142 |
|
|
|
143 |
|
|
parameterizing ds_mapredChildJavaOpts and ds_sample
|
144 |
|
|
------------------------------------------------------------------------
|
145 |
|
|
r24606 | marek.horst | 2014-02-03 17:41:38 +0100 (Mon, 03 Feb 2014) | 1 line
|
146 |
|
|
|
147 |
|
|
renaming pig_parallel parameter to ds_parallel
|
148 |
|
|
------------------------------------------------------------------------
|
149 |
|
|
r24603 | marek.horst | 2014-02-03 17:34:14 +0100 (Mon, 03 Feb 2014) | 1 line
|
150 |
|
|
|
151 |
|
|
updating default similarity values
|
152 |
|
|
------------------------------------------------------------------------
|
153 |
|
|
r24599 | marek.horst | 2014-02-03 17:22:22 +0100 (Mon, 03 Feb 2014) | 1 line
|
154 |
|
|
|
155 |
|
|
setting pig_parallel=40
|
156 |
|
|
------------------------------------------------------------------------
|
157 |
|
|
r24578 | marek.horst | 2014-02-03 13:52:25 +0100 (Mon, 03 Feb 2014) | 1 line
|
158 |
|
|
|
159 |
|
|
upgrading coansys similarity module from document-similarity-workflow to document-similarity-ranked-workflow
|
160 |
|
|
------------------------------------------------------------------------
|
161 |
|
|
r23998 | marek.horst | 2014-01-10 15:55:18 +0100 (Fri, 10 Jan 2014) | 1 line
|
162 |
|
|
|
163 |
|
|
changing default ds_tfidfMinValue from 0.4 to 0.6 to limit results
|
164 |
|
|
------------------------------------------------------------------------
|
165 |
|
|
r23997 | marek.horst | 2014-01-10 15:54:47 +0100 (Fri, 10 Jan 2014) | 1 line
|
166 |
|
|
|
167 |
|
|
updating default job properties
|
168 |
|
|
------------------------------------------------------------------------
|
169 |
|
|
r23961 | marek.horst | 2014-01-08 17:25:22 +0100 (Wed, 08 Jan 2014) | 2 lines
|
170 |
|
|
|
171 |
|
|
handling similarityTopnDocumentPerDocument and tfidfTopnTermPerDocument doc-sim parameters provided at runtime
|
172 |
|
|
|
173 |
|
|
------------------------------------------------------------------------
|
174 |
|
|
r23898 | marek.horst | 2014-01-02 14:35:10 +0100 (Thu, 02 Jan 2014) | 1 line
|
175 |
|
|
|
176 |
|
|
parameterizing ds_tfidfMinValue
|
177 |
|
|
------------------------------------------------------------------------
|
178 |
|
|
r23447 | marek.horst | 2013-12-16 19:22:41 +0100 (Mon, 16 Dec 2013) | 1 line
|
179 |
|
|
|
180 |
|
|
updating default datastores in job properties
|
181 |
|
|
------------------------------------------------------------------------
|
182 |
|
|
r22901 | mateusz.fedoryszak | 2013-12-09 16:30:19 +0100 (Mon, 09 Dec 2013) | 1 line
|
183 |
|
|
|
184 |
|
|
properties
|
185 |
|
|
------------------------------------------------------------------------
|
186 |
|
|
r22899 | mateusz.fedoryszak | 2013-12-09 16:28:39 +0100 (Mon, 09 Dec 2013) | 1 line
|
187 |
|
|
|
188 |
|
|
new CoAnSys version
|
189 |
|
|
------------------------------------------------------------------------
|
190 |
|
|
r22794 | mateusz.fedoryszak | 2013-12-06 12:21:29 +0100 (Fri, 06 Dec 2013) | 1 line
|
191 |
|
|
|
192 |
|
|
renaming io parameters
|
193 |
|
|
------------------------------------------------------------------------
|
194 |
|
|
r22632 | mateusz.fedoryszak | 2013-11-29 18:24:59 +0100 (Fri, 29 Nov 2013) | 1 line
|
195 |
|
|
|
196 |
|
|
new format of input data
|
197 |
|
|
------------------------------------------------------------------------
|
198 |
|
|
r22570 | mateusz.fedoryszak | 2013-11-29 14:41:45 +0100 (Fri, 29 Nov 2013) | 1 line
|
199 |
|
|
|
200 |
|
|
MiniOozie support
|
201 |
|
|
------------------------------------------------------------------------
|
202 |
|
|
r22568 | mateusz.fedoryszak | 2013-11-29 14:39:55 +0100 (Fri, 29 Nov 2013) | 1 line
|
203 |
|
|
|
204 |
|
|
Pig parallel param
|
205 |
|
|
------------------------------------------------------------------------
|
206 |
|
|
r22556 | mateusz.fedoryszak | 2013-11-29 11:51:26 +0100 (Fri, 29 Nov 2013) | 1 line
|
207 |
|
|
|
208 |
|
|
fixes
|
209 |
|
|
------------------------------------------------------------------------
|
210 |
|
|
r22555 | mateusz.fedoryszak | 2013-11-29 11:50:59 +0100 (Fri, 29 Nov 2013) | 1 line
|
211 |
|
|
|
212 |
|
|
removing unnecessary lines
|
213 |
|
|
------------------------------------------------------------------------
|
214 |
|
|
r22445 | mateusz.fedoryszak | 2013-11-26 14:57:12 +0100 (Tue, 26 Nov 2013) | 1 line
|
215 |
|
|
|
216 |
|
|
Moving generic converter to common
|
217 |
|
|
------------------------------------------------------------------------
|
218 |
|
|
r22420 | mateusz.fedoryszak | 2013-11-25 13:08:50 +0100 (Mon, 25 Nov 2013) | 1 line
|
219 |
|
|
|
220 |
|
|
fixing property misuse
|
221 |
|
|
------------------------------------------------------------------------
|
222 |
|
|
r22410 | mateusz.fedoryszak | 2013-11-25 11:37:52 +0100 (Mon, 25 Nov 2013) | 1 line
|
223 |
|
|
|
224 |
|
|
missing brackets
|
225 |
|
|
------------------------------------------------------------------------
|
226 |
|
|
r22409 | mateusz.fedoryszak | 2013-11-25 11:37:10 +0100 (Mon, 25 Nov 2013) | 1 line
|
227 |
|
|
|
228 |
|
|
Generic converting mapper
|
229 |
|
|
------------------------------------------------------------------------
|
230 |
|
|
r22229 | mateusz.fedoryszak | 2013-11-18 11:07:35 +0100 (Mon, 18 Nov 2013) | 1 line
|
231 |
|
|
|
232 |
|
|
somewhat works (no errors nor output)
|
233 |
|
|
------------------------------------------------------------------------
|
234 |
|
|
r21965 | mateusz.fedoryszak | 2013-11-13 10:04:18 +0100 (Wed, 13 Nov 2013) | 1 line
|
235 |
|
|
|
236 |
|
|
basic converters
|
237 |
|
|
------------------------------------------------------------------------
|
238 |
|
|
r21733 | marek.horst | 2013-11-04 10:49:31 +0100 (Mon, 04 Nov 2013) | 1 line
|
239 |
|
|
|
240 |
|
|
introducing "icm-iis-documentssimilarity"
|
241 |
|
|
------------------------------------------------------------------------
|
242 |
|
|
r21730 | marek.horst | 2013-11-04 10:47:29 +0100 (Mon, 04 Nov 2013) | 1 line
|
243 |
|
|
|
244 |
|
|
Share project "icm-iis-documentssimilarity" into "https://svn.driver.research-infrastructures.eu/driver"
|
245 |
|
|
------------------------------------------------------------------------
|