Skip to content

Commit 4de7593

Browse files
committed
Modify UnicodeDecodeError text in Python 2.x
1 parent d818133 commit 4de7593

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

‎ch05/classify.py‎

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,9 @@ def prepare_sent_features():
5454
ifnottext:
5555
meta[pid]['AvgSentLen'] =meta[pid]['AvgWordLen'] =0
5656
else:
57-
text=text.decode('utf-8')
57+
fromplatformimportpython_version
58+
ifpython_version().startswith('2'):
59+
text=text.decode('utf-8')
5860
sent_lens= [len(nltk.word_tokenize(
5961
sent)) forsentinnltk.sent_tokenize(text)]
6062
meta[pid]['AvgSentLen'] =np.mean(sent_lens)

0 commit comments

Comments
(0)