Index of /largescaleproductcorpus/data/
../
htmlCorpus/ 24-Sep-2018 15:47 -
languagemodelling/ 26-Feb-2021 17:02 -
swc/ 06-Oct-2020 14:47 -
trainingSubsets/ 19-Dec-2018 11:22 -
v2/ 04-May-2022 12:16 -
v2020/ 23-Aug-2021 15:07 -
v2_nonnorm/ 12-Feb-2020 13:30 -
wdc-block/ 12-May-2023 15:36 -
wdc-products/ 13-Jul-2023 13:23 -
GS_ListingPages.txt 22-Oct-2018 13:26 66K
amazon_training.json.gzip 20-Aug-2019 08:57 5M
cat_gs_Evaluation.txt 26-Oct-2018 12:37 167K
categories_clusters_testing.json.gzip 20-Aug-2019 08:57 7M
categories_clusters_training.json.gzip 20-Aug-2019 08:57 30M
categories_gold_standard_offers.json.gzip 20-Aug-2019 08:57 6M
categories_offers_en_clusters.csv.gzip 20-Aug-2019 08:57 33M
gs_cameras.txt 18-Mar-2019 15:05 134K
gs_computers.txt 18-Mar-2019 15:05 145K
gs_shoes.txt 18-Mar-2019 15:05 153K
gs_specTables.zip 07-Nov-2018 10:49 568K
gs_watches.txt 18-Mar-2019 15:05 146K
idclusters.json.gz 13-Dec-2018 11:02 137M
nonnormalizedOffers.json.gz 10-Feb-2020 10:23 8G
nonnormalizedOffers_english.json.gz 10-Feb-2020 13:55 5G
offers.json.gz 13-Dec-2018 11:05 6G
offers_english.json.gz 13-Dec-2018 11:06 4G
pages_indexinfo.txt 29-Nov-2018 17:53 8G
specTables.json.gz 27-Dec-2018 10:32 2G
specificationTables.zip 07-Nov-2018 12:57 131M