nohup: ignoring input Nov 26, 2014 5:13:37 PM de.mannheim.uni.searchjoin.SearchJoin searchJoinForTable INFO: Start search join Nov 26, 2014 5:13:38 PM de.mannheim.uni.IO.ConvertFileToTable readTable INFO: Start to read table senators/p8lg8v2i48qv6djo9gu0g7h-senators.csv 14/11/26 17:13:38 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:13:38 CET 2014: 0 of 0 tasks completed (3/3 active threads). Nov 26, 2014 5:13:38 PM de.mannheim.uni.TableProcessor.TableKeyIdentifier identifyKeysNaive INFO: The key column is senators with uniqueness of 1.0 Nov 26, 2014 5:13:38 PM de.mannheim.uni.TableProcessor.TableKeyIdentifier identifyKeysNaive INFO: Time for single key identification: 0.002 Nov 26, 2014 5:13:38 PM de.mannheim.uni.IO.ConvertFileToTable readTable INFO: Time reading the table senators/p8lg8v2i48qv6djo9gu0g7h-senators.csv: 0.214 Nov 26, 2014 5:13:38 PM de.mannheim.uni.searchjoin.SearchJoin findJoinsForColumnFast INFO: Searching index ... 14/11/26 17:13:38 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:13:38 CET 2014: 0 of 3 tasks completed (2/2 active threads). 14/11/26 17:13:38 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:13:38 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:13:38 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:00:00.008 so far) 14/11/26 17:13:48 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:13:48 CET 2014: 0 of 3 tasks completed (2/2 active threads). 14/11/26 17:13:48 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:13:48 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:13:48 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:00:10.010 so far) 14/11/26 17:13:58 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:13:58 CET 2014: 0 of 3 tasks completed (2/2 active threads). 14/11/26 17:13:58 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:13:58 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:13:58 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:00:20.010 so far) 14/11/26 17:14:08 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:14:08 CET 2014: 0 of 3 tasks completed (2/2 active threads). 14/11/26 17:14:08 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:14:08 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:14:08 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:00:30.011 so far) 14/11/26 17:14:18 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:14:18 CET 2014: 1 of 3 tasks completed (2/2 active threads). 14/11/26 17:14:18 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:14:18 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:14:18 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:00:40.011 so far) 14/11/26 17:14:28 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:14:28 CET 2014: 1 of 3 tasks completed (2/2 active threads). 14/11/26 17:14:28 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:14:28 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:14:28 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:00:50.011 so far) 14/11/26 17:14:38 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:14:38 CET 2014: 1 of 3 tasks completed (2/2 active threads). 14/11/26 17:14:38 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:14:38 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:14:38 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:01:00.012 so far) 14/11/26 17:14:48 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:14:48 CET 2014: 1 of 3 tasks completed (2/2 active threads). 14/11/26 17:14:48 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:14:48 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:14:48 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:01:10.012 so far) 14/11/26 17:14:58 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:14:58 CET 2014: 2 of 3 tasks completed (1/1 active threads). 14/11/26 17:14:58 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:14:58 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:14:58 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:01:20.013 so far) 14/11/26 17:15:08 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:15:08 CET 2014: 2 of 3 tasks completed (1/1 active threads). 14/11/26 17:15:08 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:15:08 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left 14/11/26 17:15:08 INFO concurrent.RunnableProgressReporter: FindJoinsForColumnFast: still running (00:01:30.013 so far) 14/11/26 17:15:18 INFO concurrent.RunnableQueueSizeReporter: Wed Nov 26 17:15:18 CET 2014 Current queue size: 0 (0 = 0.0%) -> 00:00:00.000 left Nov 26, 2014 5:15:18 PM de.mannheim.uni.searchjoin.SearchJoin findJoinsForColumnFast INFO: Grouping data ... Nov 26, 2014 5:15:18 PM de.mannheim.uni.searchjoin.SearchJoin findJoinsForColumnFast INFO: Preparing join ... 14/11/26 17:15:18 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:15:18 CET 2014: 49 of 60 tasks completed (3/24 active threads). Nov 26, 2014 5:15:18 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: ------------Start data fusion---------- Nov 26, 2014 5:15:18 PM de.mannheim.uni.datafusion.DataFuser fuseCompleteTableFast INFO: Checking 60 different tables returned from search Nov 26, 2014 5:15:18 PM de.mannheim.uni.datafusion.DataFuser fuseCompleteTableFast INFO: Fusing 60 joined tables 14/11/26 17:15:18 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:15:18 CET 2014: 0 of 3 tasks completed (3/3 active threads). Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: The new table has 234 columns Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: Start data cleaning Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: ------------------CLEANING THE TABLE------------ Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.TableDataCleaner normalizeColumnUnit INFO: Unit was normalized to Time using unit s Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.TableDataCleaner normalizeColumnUnit INFO: Unit was normalized to Time using unit s Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.TableDataCleaner normalizeColumnUnit INFO: Unit was normalized to Time using unit s Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.TableDataCleaner normalizeColumnUnit INFO: Unit was normalized to Length using unit m Nov 26, 2014 5:15:24 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Detecting duplicates with instance matching 14/11/26 17:15:24 INFO concurrent.RunnableProgressReporter: Wed Nov 26 17:15:24 CET 2014: 1003 of 10174 tasks completed (24/24 active threads). Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Instance based matching took 0s Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Detecting duplicates with label based matching Nov 26, 2014 5:15:25 PM de.mannheim.uni.schemamatching.label.TablesLabeledBasedMatcher checkForDuplicates INFO: Deciding label-based matching took 0s Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Label based matching took 0s Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Merging duplicates Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: leonidy hst io a jupiter ??????na ve vesm???ru exoplanety(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346881964267_1937.arc.gz4489233119645528253.tar.gz/89254869_0_1814994932585782769.csv.gz) was merged with ||leonidy hst io a jupiter ??????na ve vesm???ru exoplanety||leonidy hst io a jupiter ??????na ve vesm???ru exoplanety(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346882164411_2635.arc.gz3675967730166965060.tar.gz/67882374_0_3454967123217341081.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346883684005_3982.arc.gz7130272369085638690.tar.gz/11316171_0_2090899843327607602.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: 1 envolée (2001)(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860467_1346897932110_3531.arc.gz7035930346182584068.tar.gz/52519419_7_8288572171588707481.csv.gz) was merged with ||1 envolée (2001)(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860828_1346952487023_2644.arc.gz913742311716098958.tar.gz/14539842_7_6329470187593556161.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: score(/data/SearchJoins/Indexes/../../tables_gz/tables/19597323_0_252332043748893303.csv.gz) was merged with ||||||||score(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860567_1346913093443_433.arc.gz3507917094191148601.tar.gz/80485582_0_1878018787149333422.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860567_1346913839210_248.arc.gz3310666016584370807.tar.gz/36750178_0_6498631378793916815.csv.gz||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172239_1346997705856_3532.arc.gz4315441110134700735.tar.gz/40592413_0_7980077265697253278.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/16854953_0_3873183440426761402.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: 9 j. 10 h. 00 min.(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860467_1346897932110_3531.arc.gz7035930346182584068.tar.gz/52519419_7_8288572171588707481.csv.gz) was merged with ||9 j. 10 h. 00 min.(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860828_1346952487023_2644.arc.gz913742311716098958.tar.gz/14539842_7_6329470187593556161.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: order(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860765_1346915798242_3096.arc.gz5391069087763079579.tar.gz/20292930_0_7743028630247252714.csv.gz) was merged with ||valuation||quantity of land(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172142_1347011627034_549.arc.gz8013296911185648347.tar.gz/12278187_0_1611828636647204521.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172142_1347011627034_549.arc.gz8013296911185648347.tar.gz/12278187_0_1611828636647204521.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: pf(/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_0_7883570562120782479.csv.gz) was merged with ||plc||rank||points||points||plc||points||plc(||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_23_2402402053693924281.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172234_1347007471075_2070.arc.gz6324562975617792783.tar.gz/29204996_1_2848314215568303173.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_23_2402402053693924281.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_15_50177848244783853.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_27_5426684217932859472.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_7_1454512229778640466.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_19_7577518932352553891.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: pavel koten(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346883684005_3982.arc.gz7130272369085638690.tar.gz/11316171_0_2090899843327607602.csv.gz) was merged with ||pavel koten||pavel koten(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346882164411_2635.arc.gz3675967730166965060.tar.gz/67882374_0_3454967123217341081.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346881964267_1937.arc.gz4489233119645528253.tar.gz/89254869_0_1814994932585782769.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: album(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860493_1346904464466_1696.arc.gz3254410375847365099.tar.gz/49165520_0_3053503582764433610.csv.gz) was merged with ||album(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860493_1346906310597_2931.arc.gz9127859670084096910.tar.gz/40536253_0_9079959780514275794.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: # races(/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_27_5426684217932859472.csv.gz) was merged with ||vol.||# races||events||vol.(||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172258_1346985533512_688.arc.gz2997955082406093812.tar.gz/80532519_9_9095725285799743455.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_30_2863866408558085384.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/7090298_0_2807977013414582074.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347010037365_2230.arc.gz8852105782740093855.tar.gz/22682804_9_7672869823291083532.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: time(/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_22_7046816461477507843.csv.gz) was merged with ||null||time||1732-1799(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860611_1346928520024_3937.arc.gz2404704644924951471.tar.gz/42063777_0_4333027701566850773.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_29_1322455137846596762.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/49128201_0_1706516818020791243.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: 18(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860611_1346927441400_294.arc.gz4536284081282534080.tar.gz/59347859_0_3633012732420720763.csv.gz) was merged with ||age||pf||l2 divisor (final)||l2 divisor||l2 divisor (final)||l2 divisor(||/data/SearchJoins/Indexes/../../tables_gz/tables/54853892_0_7656373944757074025.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_3_3342857431041467724.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: # races(/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_23_2402402053693924281.csv.gz) was merged with ||# races||# races||# races||# races||# races||term--years(||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_3_3653776176765425727.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_11_3211970037291386261.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_7_1454512229778640466.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_15_50177848244783853.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_19_7577518932352553891.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/52293791_0_1186837016815980.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: minuten(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860767_1346929118848_1935.arc.gz7991610128697781267.tar.gz/7790022_0_2863805980443125617.csv.gz) was merged with ||w-l||state share||plc||ave.||plc(||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860765_1346915798242_3096.arc.gz5391069087763079579.tar.gz/20292930_0_7743028630247252714.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_3_3653776176765425727.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172234_1347007471075_2070.arc.gz6324562975617792783.tar.gz/29204996_1_2848314215568303173.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_7_1454512229778640466.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: w-l-t(/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_2_4523783067033658164.csv.gz) was merged with ||w-l-t||min:sec||367||points||367||min:sec(||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_3_3342857431041467724.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_29_1322455137846596762.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860467_1346897932110_3531.arc.gz7035930346182584068.tar.gz/52519419_7_8288572171588707481.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/7090298_0_2807977013414582074.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860828_1346952487023_2644.arc.gz913742311716098958.tar.gz/14539842_7_6329470187593556161.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_22_7046816461477507843.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: artist(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860493_1346904464466_1696.arc.gz3254410375847365099.tar.gz/49165520_0_3053503582764433610.csv.gz) was merged with ||artist(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860493_1346906310597_2931.arc.gz9127859670084096910.tar.gz/40536253_0_9079959780514275794.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: prize(/data/SearchJoins/Indexes/../../tables_gz/tables/16854953_0_3873183440426761402.csv.gz) was merged with ||prize||prize(||/data/SearchJoins/Indexes/../../tables_gz/tables/19597323_0_252332043748893303.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/58293237_0_5690193541445410134.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: pa(/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_2_4523783067033658164.csv.gz) was merged with ||w-l-t(||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_0_7883570562120782479.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: pf(/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_1_6644538897976329692.csv.gz) was merged with ||pa||rank(||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_3_3342857431041467724.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/58293237_0_5690193541445410134.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: soviétique(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860828_1346952487023_2644.arc.gz913742311716098958.tar.gz/14539842_7_6329470187593556161.csv.gz) was merged with ||soviétique(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860467_1346897932110_3531.arc.gz7035930346182584068.tar.gz/52519419_7_8288572171588707481.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: rank(/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) was merged with ||points||points||cum pts||points(||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_22_7046816461477507843.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_32_5789680583221208457.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_32_5789680583221208457.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_29_1322455137846596762.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: ????????????(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860567_1346913093443_433.arc.gz3507917094191148601.tar.gz/80485582_0_1878018787149333422.csv.gz) was merged with ||????????????||????????????(||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172239_1346997705856_3532.arc.gz4315441110134700735.tar.gz/40592413_0_7980077265697253278.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860567_1346913839210_248.arc.gz3310666016584370807.tar.gz/36750178_0_6498631378793916815.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: level 2(/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) was merged with ||pa(||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_0_7883570562120782479.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: last name(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347010037365_2230.arc.gz8852105782740093855.tar.gz/22682804_9_7672869823291083532.csv.gz) was merged with ||last name(||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172258_1346985533512_688.arc.gz2997955082406093812.tar.gz/80532519_9_9095725285799743455.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: level 1(/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) was merged with ||rank||1||seconden(||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/49128201_0_1706516818020791243.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860767_1346929118848_1935.arc.gz7991610128697781267.tar.gz/7790022_0_2863805980443125617.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: 25. listopadu 1999(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346883684005_3982.arc.gz7130272369085638690.tar.gz/11316171_0_2090899843327607602.csv.gz) was merged with ||25. listopadu 1999||25. listopadu 1999(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346882164411_2635.arc.gz3675967730166965060.tar.gz/67882374_0_3454967123217341081.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860791_1346881964267_1937.arc.gz4489233119645528253.tar.gz/89254869_0_1814994932585782769.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: id #(/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) was merged with ||id #||w-l-t(||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_1_6644538897976329692.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: points(/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_27_5426684217932859472.csv.gz) was merged with ||w-l||plc||plc||plc||pa||points(||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_32_5789680583221208457.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_29_1322455137846596762.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_30_2863866408558085384.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_1_6644538897976329692.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_30_2863866408558085384.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: record #(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347010037365_2230.arc.gz8852105782740093855.tar.gz/22682804_9_7672869823291083532.csv.gz) was merged with ||record #||plc(||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172258_1346985533512_688.arc.gz2997955082406093812.tar.gz/80532519_9_9095725285799743455.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_11_3211970037291386261.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: f(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347009622169_1259.arc.gz6328362443779598330.tar.gz/16004273_39_4193712413914140442.csv.gz) was merged with ||f(||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1350433107011_1350487200287_5.arc.gz2288024280953615055.tar.gz/84213202_39_4913229205883945803.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: dod(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347010037365_2230.arc.gz8852105782740093855.tar.gz/22682804_9_7672869823291083532.csv.gz) was merged with ||age or dob||max avg (win out)||national ranking||member since||november 19||min avg (lose out)||jaar||max avg (lose out)||average||min avg (win out)||born||totwgt||max avg (win out)||year||???????????? ???????????????||2010||death||max avg (lose out)||birth||built||1992||min avg (win out)||min avg (lose out)||#||average(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860807_1346934516794_579.arc.gz580451016046253939.tar.gz/7527864_5_2268997509613351337.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/7090298_0_2807977013414582074.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860596_1346938152839_2219.arc.gz6876240675716612747.tar.gz/8220519_0_7663235719041797015.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172186_1346996335310_2457.arc.gz8381037101601569570.tar.gz/58854521_1_4115309309301812578.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860767_1346929118848_1935.arc.gz7991610128697781267.tar.gz/7790022_0_2863805980443125617.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/54853892_0_7656373944757074025.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/7090298_0_2807977013414582074.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860767_1346929632585_1974.arc.gz1780917889169822432.tar.gz/41005541_0_7301356455172572153.csv.gz||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172239_1346997705856_3532.arc.gz4315441110134700735.tar.gz/40592413_0_7980077265697253278.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860611_1346927441400_294.arc.gz4536284081282534080.tar.gz/59347859_0_3633012732420720763.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860807_1346933801036_226.arc.gz9204224743296759794.tar.gz/10147506_0_8746295800068522443.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860807_1346933801036_226.arc.gz9204224743296759794.tar.gz/10147506_0_8746295800068522443.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860467_1346895296582_1027.arc.gz4247041625054366969.tar.gz/18235013_0_5084549912620567526.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860611_1346927441400_294.arc.gz4536284081282534080.tar.gz/59347859_0_3633012732420720763.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/52886071_0_1159965426153092696.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: ????????????????????? ????????????????????????.(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860567_1346913839210_248.arc.gz3310666016584370807.tar.gz/36750178_0_6498631378793916815.csv.gz) was merged with ||????????????????????? ????????????????????????.||?????????????????????????????? ????????????????????????.(||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1346981172239_1346997705856_3532.arc.gz4315441110134700735.tar.gz/40592413_0_7980077265697253278.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860567_1346913093443_433.arc.gz3507917094191148601.tar.gz/80485582_0_1878018787149333422.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: pf(/data/SearchJoins/Indexes/../../tables_gz/tables/18735579_2_4523783067033658164.csv.gz) was merged with ||rank(||/data/SearchJoins/Indexes/../../tables_gz/tables/19597323_0_252332043748893303.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: domestic(/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1350433107011_1350487200287_5.arc.gz2288024280953615055.tar.gz/84213202_39_4913229205883945803.csv.gz) was merged with ||domestic(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347009622169_1259.arc.gz6328362443779598330.tar.gz/16004273_39_4193712413914140442.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: details(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860767_1346929632585_1974.arc.gz1780917889169822432.tar.gz/41005541_0_7301356455172572153.csv.gz) was merged with ||details(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860767_1346929118848_1935.arc.gz7991610128697781267.tar.gz/7790022_0_2863805980443125617.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: playoff status (unofficial)(/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) was merged with ||playoff status (unofficial)(||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: #(/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860493_1346904464466_1696.arc.gz3254410375847365099.tar.gz/49165520_0_3053503582764433610.csv.gz) was merged with ||1||level 2||15||#||15(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860611_1346927441400_294.arc.gz4536284081282534080.tar.gz/59347859_0_3633012732420720763.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz||/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1350433107011_1350487200287_5.arc.gz2288024280953615055.tar.gz/84213202_39_4913229205883945803.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860493_1346906310597_2931.arc.gz9127859670084096910.tar.gz/40536253_0_9079959780514275794.csv.gz||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347009622169_1259.arc.gz6328362443779598330.tar.gz/16004273_39_4193712413914140442.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: partei(/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1350433107057_1350490847678_731.arc.gz3909794927417317263.tar.gz/45057392_0_5477358415784204804.csv.gz) was merged with ||partei(en)(||/data/SearchJoins/Indexes/../../tables_gz/tables/9335304_9_4806972770207214739.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: plc(/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_22_7046816461477507843.csv.gz) was merged with ||page(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346876860807_1346934516794_579.arc.gz580451016046253939.tar.gz/7527864_5_2268997509613351337.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: points(/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_11_3211970037291386261.csv.gz) was merged with ||points||plc||points(||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_3_3653776176765425727.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_15_50177848244783853.csv.gz||/data/SearchJoins/Indexes/../../tables_gz/tables/72738450_19_7577518932352553891.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: city(/data/SearchJoins/Indexes/../../tables_gz/tables/15251149_0_3849441424472084861.csv.gz) was merged with ||city(||/data/SearchJoins/Indexes/../../tables_gz/tables/69109807_0_1608852235502255382.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DuplicateResolver$DuplicateResolverThread mergeColumns INFO: nc(/data/SearchJoins/Indexes/../../html_tables/tar/common-crawl_parse-output_segment_1350433107011_1350487200287_5.arc.gz2288024280953615055.tar.gz/84213202_39_4913229205883945803.csv.gz) was merged with ||nc(||/data/SearchJoins/Indexes/../../htmlTablesTarCompleted/common-crawl_parse-output_segment_1346981172231_1347009622169_1259.arc.gz6328362443779598330.tar.gz/16004273_39_4193712413914140442.csv.gz) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Merging took 0s Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Removeing null columns Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: time(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: jv score(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: min:sec(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: no(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: notes(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: memid(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: time(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: date(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: b. p.(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner filterColumnsByColumnDensity INFO: Column removed because of NULL values: home(density is 1.0) Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.TableDataCleaner cleanTable INFO: Removing null rows Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: Time for cleaning the table: 6967 Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: The new table after data cleaining has 100 columns Nov 26, 2014 5:15:25 PM de.mannheim.uni.datafusion.DataFuser fuseQueryTableWithEntityTables INFO: Time for data fusion: 6971 Nov 26, 2014 5:15:25 PM de.mannheim.uni.searchjoin.SearchJoin searchJoinForTable INFO: Search join ended: 107.43