1. use
codecs module to read and write files:import codecsf = codecs.open('file.txt', mode="w", encoding="utf-8-sig")2. Don't mix strings and unicode. (Always prefix "strings" with
u (e.g. u"hello world"))3. Start your python script with
# -*- coding: utf-8 -*- in case you might have any unicode characters in your code or in hardcoded "strings" in your script.4. Also read http://lobstertech.com/python_unicode.html and http://www.evanjones.ca/python-utf8.html for more info on using UTF-8/unicode with Python.
Other sources:
http://docs.python.org/howto/unicode.html
http://www.carlosble.com/2010/12/understanding-python-and-unicode/
Nice post!
ReplyDelete