Bugs occurred in other datasets #2

Cloudcatcher888 · 2019-09-18T08:42:52Z

I get a bug in T-mall datasets called:

(base) wzk@ddst:~/work/Sets2Sets$ python Sets2Sets.py ./data/alibaba_history.csv ./data/alibaba_future.csv 1 2 1
start dictionary generation...
{'MATERIAL_NUMBER': 9531}
# dimensions of final vector: 9531 | 2962
finish dictionary generation*****
num of vectors having entries more than 1: 16462
num of vectors having entries more than 1: 15275
Traceback (most recent call last):
  File "Sets2Sets.py", line 990, in <module>
    main(sys.argv)
  File "Sets2Sets.py", line 955, in main
    codes_freq = get_codes_frequency_no_vector(data_chunk[past_chunk],input_size,data_chunk[future_chunk].keys())
  File "Sets2Sets.py", line 935, in get_codes_frequency_no_vector
    for idx in X[pid]:
KeyError: '371250'

Have anyone met this before? I'd be really appreciated if anyone can help.

The text was updated successfully, but these errors were encountered:

HaojiHu · 2019-09-18T20:10:54Z

The variable pid goes out of the bound of X. Try to make sure there is a pid 371250 in the data_chunk[past_chunk]. I had some preprocess to make sure the keys in data_chunk[future_chunk].keys() are contained in data_chunk[past_chunk].keys(). Thanks for reporting this. I will try to fix it later.

Cloudcatcher888 closed this as completed Sep 18, 2019

Cloudcatcher888 reopened this Sep 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugs occurred in other datasets #2

Bugs occurred in other datasets #2

Cloudcatcher888 commented Sep 18, 2019 •

edited

HaojiHu commented Sep 18, 2019 •

edited

Bugs occurred in other datasets #2

Bugs occurred in other datasets #2

Comments

Cloudcatcher888 commented Sep 18, 2019 • edited

HaojiHu commented Sep 18, 2019 • edited

Cloudcatcher888 commented Sep 18, 2019 •

edited

HaojiHu commented Sep 18, 2019 •

edited