Summary

Audit all strings in djblets and convert to future.unicode_literals

Review Request #4944 — Created Nov. 12, 2013 and submitted Nov. 13, 2013, 2:17 a.m.

Information

Owner

david

Repository

Djblets

Branch

master

Bugs

Depends On

Reviewers

Groups

djblets

People

Description

Audit all strings in djblets and convert to __future__.unicode_literals

This change represents an audit of all of the strings in the djblets codebase.
Where the strings were actually binary data (specifically with our treatment of
settings.EMAIL_HOST_USER and settings.EMAIL_HOST_PASSWORD, and anything passed
into sha1() or md5()), I've made it use the bytes type. Otherwise, these are
now unicode objects by virtue of importing unicode_literals from __future__.

There's also a bunch of fixes to our handling of cache keys, especially those
for large data, much more consistent. We always keep the cache key as a unicode
object and create derivative keys (like the chunk keys for large data) with
that. At the last moment, before calling cache.get() or cache.set(), we pass
that to make_cache_key, which will shorten it if necessary using md5, and then
encode to utf-8 (since not all cache backends support the unicode object as
keys).

Testing Done


Verified my assumptions about treatment of unicode vs. str in getattr,

hasattr, delattr, __getattr__, __hasattr__, __delattr__, hash, regular

  expressions, and memcache keys.
Ran djblets unit tests
Ran Review Board unit tests

Issues

Description	From	Last Updated
Wrong side of the '	chipx86	Nov. 12, 2013, 5:12 p.m.
Does the first string need a 'u'?	chipx86	Nov. 12, 2013, 5:12 p.m.
Does this need a u'?	chipx86	Nov. 12, 2013, 5:12 p.m.
Should probably be r''	chipx86	Nov. 12, 2013, 5:13 p.m.
Shouldn't be necessary to put quotes around this. %r should take care of that.	chipx86	Nov. 13, 2013, 2:16 a.m.

Change Summary:

Fix the one place I seem to have misplaced the u ('u..'). Grepping indicates that I didn't make other mistakes.

Diff:

Revision 2 (+1645 -1651)

Show changes

	djblets/__init__.py
	djblets/settings.py
	djblets/auth/forms.py
	djblets/auth/util.py
	djblets/auth/views.py
	djblets/datagrid/grids.py
	djblets/datagrid/tests.py
	djblets/datagrid/templatetags/datagrid.py
	66 more

djblets/__init__.py (Diff revision 1)
The issue has been resolved. Show all issues
```
Wrong side of the '
```
djblets/datagrid/tests.py (Diff revision 1)
The issue has been resolved. Show all issues
```
Does the first string need a 'u'?
```
djblets/extensions/errors.py (Diff revision 1)
The issue has been resolved. Show all issues
```
Does this need a u'?
```
djblets/extensions/resources.py (Diff revision 1)
The issue has been resolved. Show all issues
```
Should probably be r''
```

Change Summary:

Fix noted issues.

Diff:

Revision 3 (+1645 -1651)

Show changes

	djblets/__init__.py
	djblets/settings.py
	djblets/auth/forms.py
	djblets/auth/util.py
	djblets/auth/views.py
	djblets/datagrid/grids.py
	djblets/datagrid/tests.py
	djblets/datagrid/templatetags/datagrid.py
	66 more

Change Summary:

Add unicode_literals, undo the "u" prefixes, and fix a few bugs I found in the handling of strings in the large-data caching.

Summary:

Go through and clarify types of all strings in djblets.

Audit all strings in djblets and convert to __future__.unicode_literals

Description:

~		Go through and clarify types of all strings in djblets.
	~	Audit all strings in djblets and convert to `__future__.unicode_literals`

		This change represents an audit of all of the strings in the djblets codebase.
		Where the strings were actually binary data (specifically with our treatment of
~		`settings.EMAIL_HOST_USER` and `settings.EMAIL_HOST_PASSWORD`, and anything that
~		gets passed into `sha1()` or `md5()`), I've made it use the `bytes` type. Otherwise,
~		I've changed all of the "text" strings to be `unicode` literals instead of `str`.
	~	`settings.EMAIL_HOST_USER` and `settings.EMAIL_HOST_PASSWORD`, and anything passed
	~	into `sha1()` or `md5()`), I've made it use the `bytes` type. Otherwise, these are
	~	now `unicode` objects by virtue of importing `unicode_literals` from `__future__`.

~		In a future change, I'll import `unicode_literals` from `__future__` in every file
~		and remove the `u` prefix. This change was a necessary intermediate step to make
~		sure I looked at every single string and made a decision about it.
	~	There's also a bunch of fixes to our handling of cache keys, especially those
	~	for large data, much more consistent. We always keep the cache key as a unicode
	~	object and create derivative keys (like the chunk keys for large data) with
	+	that. At the last moment, before calling `cache.get()` or `cache.set()`, we pass
	+	that to `make_cache_key`, which will shorten it if necessary using md5, and then
	+	encode to utf-8 (since not all cache backends support the unicode object as
	+	keys).

Diff:

Revision 4 (+344 -194)

Show changes

	djblets/__init__.py
	djblets/settings.py
	djblets/auth/forms.py
	djblets/auth/util.py
	djblets/auth/views.py
	djblets/datagrid/grids.py
	djblets/datagrid/tests.py
	djblets/datagrid/templatetags/datagrid.py
	77 more

Ship it!

```
Looks fine. Just one thing.
```

djblets/util/fields.py (Diff revision 4)

The issue has been resolved. Show all issues

Shouldn't be necessary to put quotes around this. %r should take care of that.

Status:: Completed
Change Summary:: Pushed to master (52da427).