BQ数据集问题
收藏
快速回复
BQ数据集问题
收藏
快速回复

BQ数据的train数据集有问题。pandas读取,说3行读出4行了。。。。。

BQ_train = pd.read_csv('train/BQ/train', delimiter="\t", header=None)




---------------------------------------------------------------------------ParserError                               Traceback (most recent call last) in 
      1 LCQMC_train = pd.read_csv('train/LCQMC/train', delimiter = '\t', header=None)
      2 OPPO_train = pd.read_csv('train/OPPO/train', delimiter = '\t', header=None)
----> 3 BQ_train = pd.read_csv('train/BQ/train', delimiter="\t", header=None)
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/pandas/io/parsers.py in read_csv(filepath_or_buffer, sep, delimiter, header, names, index_col, usecols, squeeze, prefix, mangle_dupe_cols, dtype, engine, converters, true_values, false_values, skipinitialspace, skiprows, skipfooter, nrows, na_values, keep_default_na, na_filter, verbose, skip_blank_lines, parse_dates, infer_datetime_format, keep_date_col, date_parser, dayfirst, cache_dates, iterator, chunksize, compression, thousands, decimal, lineterminator, quotechar, quoting, doublequote, escapechar, comment, encoding, dialect, error_bad_lines, warn_bad_lines, delim_whitespace, low_memory, memory_map, float_precision)
    686     )
    687 
--> 688     return _read(filepath_or_buffer, kwds)
    689 
    690 
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/pandas/io/parsers.py in _read(filepath_or_buffer, kwds)
    458 
    459     try:
--> 460         data = parser.read(nrows)
    461     finally:
    462         parser.close()
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/pandas/io/parsers.py in read(self, nrows)
   1196     def read(self, nrows=None):
   1197         nrows = _validate_integer("nrows", nrows)
-> 1198         ret = self._engine.read(nrows)
   1199 
   1200         # May alter columns / col_dict
/opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages/pandas/io/parsers.py in read(self, nrows)
   2155     def read(self, nrows=None):
   2156         try:
-> 2157             data = self._reader.read(nrows)
   2158         except StopIteration:
   2159             if self._first_chunk:
pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader.read()
pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader._read_low_memory()
pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader._read_rows()
pandas/_libs/parsers.pyx in pandas._libs.parsers.TextReader._tokenize_rows()
pandas/_libs/parsers.pyx in pandas._libs.parsers.raise_parser_error()
ParserError: Error tokenizing data. C error: Expected 3 fields in line 20746, saw 4
0
收藏
回复
全部评论(5)
时间顺序
JavaRoom
#2 回复于2021-09

截图:

0
回复
努力成为NLPer
#3 回复于2021-09

别用pandas读

0
回复
JavaRoom
#4 回复于2021-09
别用pandas读

为啥?

0
回复
努力成为NLPer
#5 回复于2021-09
为啥?

你不是报错吗 直接open()打开就行了呗

0
回复
JavaRoom
#6 回复于2021-09
你不是报错吗 直接open()打开就行了呗

love pandas , daxiongmao is very good

0
回复
在@后输入用户全名并按空格结束,可艾特全站任一用户