Summary

Add credentials check tool

Review Request #10223 — Created Oct. 11, 2018 and discarded Dec. 2, 2020, 7:14 p.m.

Information

Owner

ammar

Repository

ReviewBot

Branch

master

Bugs

Depends On

Reviewers

Groups

reviewbot, students

People

Description

Previously, Review Bot did not check for credentials that may have been
accidentally included in the commit. A human reviewer would have to
look out for them, but we hoped to move more of this task's burden
to Review Bot.

A new Credentials Check tool has been added to Review Bot which
looks for various key files, other sensitive files and inline embedded
AWS credentials to make sure these are not pushed to the repository.

Testing Done

Manual tests (correctly finds and creates issues on lines with the
credentials, or marks the first line of a file type (including
file types specified from options tab) that should not have been
included e.g. .pem files).

Issues

Description	From	Last Updated
Can you wrap your description and testing done at 72 Chars?	brennie	Oct. 16, 2018, 5:36 p.m.
Is this a WIP? Your description has "WIP: Add options..." If this is WIP please put it in the summary	brennie	Oct. 16, 2018, 6:27 p.m.
In your change description: "ReviewBot" -> "Review Bot" (3x)	david	Nov. 27, 2018, 4:04 p.m.
E501 line too long (83 > 79 characters)	reviewbot	Oct. 13, 2018, 12:11 p.m.
E501 line too long (84 > 79 characters)	reviewbot	Oct. 13, 2018, 12:11 p.m.
E501 line too long (85 > 79 characters)	reviewbot	Oct. 13, 2018, 12:11 p.m.
E501 line too long (80 > 79 characters)	reviewbot	Oct. 13, 2018, 12:11 p.m.
E501 line too long (84 > 79 characters)	reviewbot	Oct. 13, 2018, 12:11 p.m.
E501 line too long (89 > 79 characters)	reviewbot	Oct. 13, 2018, 12:11 p.m.
typo: "credntialscheck"	brennie	Oct. 13, 2018, 12:10 p.m.
typo: "credntialscheck"	brennie	Oct. 13, 2018, 12:11 p.m.
Missing module-level docstring	brennie	Oct. 13, 2018, 12:13 p.m.
Module imports should be formatted as: from __future__ import ... # Python STDLib imports # 3rd party imports # Imports …	brennie	Oct. 13, 2018, 12:15 p.m.
Instead of having multiple credential regexes, we can make this into a single regular expression: compiled_credential_pattern = re.compile( '\|'.join( '(%s)' …	brennie	Oct. 13, 2018, 12:16 p.m.
Single quotes here and throughout	brennie	Oct. 13, 2018, 12:17 p.m.
Missing trailing comma. This regex doesn't do what you want it to becuase of the leading [, which makes everything …	brennie	Oct. 13, 2018, 12:18 p.m.
Instead of doing this here, we should do it in CredentialsCheckTool.__init__ so that theyre not sitting here taking up memory …	brennie	Oct. 13, 2018, 12:22 p.m.
Docstrings should be of the form: """Single line summary. Multi-line description. """	brennie	Oct. 13, 2018, 12:25 p.m.
We use the Oxford comma, so there should be a comma after private keys.	brennie	Oct. 13, 2018, 12:26 p.m.
How about just "Review a single file."?	brennie	Oct. 13, 2018, 2:10 p.m.
Blank line between these.	brennie	Oct. 13, 2018, 12:36 p.m.
This will not detect files named id_rsa becuase it is not an extension. You will need a separate set of …	brennie	Oct. 13, 2018, 12:52 p.m.
We should word this as "may be a security risk" because if its a public key --PEM files can be …	brennie	Oct. 13, 2018, 12:36 p.m.
Comments should be complete sentences: they should begin with a capital letter and end with a period.	brennie	Oct. 13, 2018, 12:37 p.m.
What exceptions are we hoping to catch? We should be very specific about what we expect so that other exceptions …	brennie	Oct. 13, 2018, 12:46 p.m.
Blank line between these.	brennie	Oct. 13, 2018, 12:51 p.m.
Since pattern is a compiled regular expression, you can just do pattern.match(line)	brennie	Oct. 13, 2018, 12:47 p.m.
You're going to need to add a entrypoint for your tool.	brennie	Oct. 13, 2018, 12:49 p.m.
:file:`.pem` :file:`id_rsa`	brennie	Oct. 13, 2018, 12:51 p.m.
How about: This tool is built into ReviewBot. There is no separate installation step required.	brennie	Oct. 13, 2018, 12:51 p.m.
Typo: "credntialscheck"	brennie	Oct. 13, 2018, 12:50 p.m.
One more blank line here.	brennie	Oct. 16, 2018, 5:37 p.m.
Can you add a docstring here?	brennie	Oct. 16, 2018, 5:46 p.m.
No blank line here.	brennie	Oct. 16, 2018, 5:46 p.m.
Can you wrap these in parens to make it clear that this is supposed to be multiple lines? e.g. python …	brennie	Oct. 16, 2018, 5:47 p.m.
E128 continuation line under-indented for visual indent	reviewbot	Oct. 16, 2018, 7:03 p.m.
It looks like these wouldn't catch cases where the value was enclosed in quotes?	david	Oct. 18, 2018, 1:30 p.m.
Let's put one per line and sort them all alphabetically.	david	Oct. 18, 2018, 1:30 p.m.
Should probably be "Including this file ..." (files -> file)	david	Oct. 18, 2018, 1:31 p.m.
We should be able to use regex matching against bytestrings, so we can skip the detection/decoding here. We just need …	david	Oct. 18, 2018, 4:12 p.m.
Do we want to use .search() instead of .match()?	david	Oct. 18, 2018, 4:11 p.m.
I feel like we should be more verbose about what the problem might be (for example, "Potential disclosure of private …	david	Oct. 18, 2018, 4:11 p.m.
A link to this needs to be added to docs/reviewbot/tools/index.rst	david	Oct. 18, 2018, 1:58 p.m.
This reads a little funky. How about "Improper credentials can include things such as AWS keys hardcoded in source or …	david	Oct. 18, 2018, 2 p.m.
E124 closing bracket does not match visual indentation	reviewbot	Oct. 18, 2018, 4:33 p.m.
E124 closing bracket does not match visual indentation	reviewbot	Oct. 18, 2018, 4:33 p.m.
Can you insert this in alphabetical order?	brennie	Oct. 25, 2018, 5:31 p.m.
Can you insert this in alphabetical order?	brennie	Oct. 25, 2018, 5:31 p.m.
These regexes won't work in a few cases: Shell scripts with AWS_SECRET_KEY=... i.e., without quotes. Single-quoted string values We can …	brennie	Oct. 25, 2018, 5:25 p.m.
Single quotes.	brennie	Oct. 25, 2018, 5:31 p.m.
Single quotes around AWS_SECRET_KEY	brennie	Oct. 25, 2018, 5:31 p.m.
Single quotes here	alextechcc	Oct. 25, 2018, 6:21 p.m.
Trailing space, but it should also have a period. Also, comma after e.g..	brennie	Oct. 30, 2018, 5:21 p.m.
Missing args/kwargs	brennie	Oct. 25, 2018, 5:32 p.m.
https://docs.python.org/2/library/os.path.html#os.path.splitext	brennie	Oct. 30, 2018, 5:57 p.m.
.iteritems() is Python2 only. You'll want to do: import six # ... for name, pattern in six.iteritems(self.compiled_re):	brennie	Oct. 27, 2018, 9:45 p.m.
This should line up with the string above, e.g. ('... ' '... '),	brennie	Oct. 27, 2018, 9:45 p.m.
RBTools depends on six but if we're using it directly, we should have our own dependency on it. (I mention …	brennie	Oct. 25, 2018, 5:43 p.m.
Trailing whitespace.	brennie	Oct. 25, 2018, 5:43 p.m.
Can you insert this in alphabetical order?	brennie	Oct. 25, 2018, 5:45 p.m.
Can you insert this in alphabetical order?	brennie	Oct. 27, 2018, 9:45 p.m.
six is a third-party library, so it should go in it's own "section": import re import six from reviewbot.tools import …	david	Oct. 30, 2018, 5:57 p.m.
Formatting here could be a little nicer. If you put the parens on their own lines, then the strings will …	david	Oct. 30, 2018, 6:05 p.m.
This should use six.iteritems. Also, this can use a dict comprehension to do it all in one go: super(CredentialsCheckTool, self).__init__() …	david	Oct. 30, 2018, 6:06 p.m.
In this case I think it's probably better to just ignore the line length warning.	david	Oct. 30, 2018, 6:07 p.m.
ReviewBot -> Review Bot	david	Oct. 30, 2018, 6:07 p.m.
ReviewBot -> Review Bot	ilaw	Nov. 1, 2018, 6:08 p.m.
E501 line too long (88 > 79 characters)	reviewbot	Oct. 30, 2018, 6:15 p.m.
Should be in alphabetical order?	ilaw	Nov. 1, 2018, 6:08 p.m.
Will this fit as: for risk_name, pattern in six.iteritems( self._compiled_re): # ...	brennie	Nov. 2, 2018, 3:48 p.m.
E501 line too long (88 > 79 characters)	reviewbot	Nov. 1, 2018, 6:15 p.m.
F821 undefined name 'compiled_pattern'	reviewbot	Nov. 2, 2018, 3:51 p.m.
E501 line too long (88 > 79 characters)	reviewbot	Nov. 2, 2018, 3:50 p.m.
E501 line too long (88 > 79 characters)	reviewbot	Nov. 2, 2018, 3:51 p.m.
Add another blank line here.	david	Nov. 27, 2018, 4:06 p.m.
E501 line too long (88 > 79 characters)	reviewbot	Nov. 28, 2018, 7:38 p.m.

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (83 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (84 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (85 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (80 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (84 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (89 > 79 characters)
```

Commit:

ac44d7ecf125fc206d0459a16ce2b3c81d7bf6a7

b872485315ddf3a609b1cd467fc1e958a24586a4

Diff:

Revision 2 (+86)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

README.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
typo: "credntialscheck"
```
bot/README.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
typo: "credntialscheck"
```
bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Missing module-level docstring
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Module imports should be formatted as:

from __future__ import ...

# Python STDLib imports

# 3rd party imports

# Imports from this package


e.g.

from __future__ import unicode_literals

import re

import chardet

from reviewbot.tools import Tool

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Instead of having multiple credential regexes, we can make this into a single regular expression:

compiled_credential_pattern = re.compile(
    '|'.join(
        '(%s)' % pattern
        for pattern in credential_patterns
    )
)

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Single quotes here and throughout
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Missing trailing comma.
This regex doesn't do what you want it to becuase of the leading [, which makes everything up to ] a character this regex will match.
e.g.

>>> import re
>>> x = re.compile(r"[AWS_SECRET_KEY\s*=\s*[A-Za-z0-9/+=]{40}")
>>> m = x.match('SSSSSSSSSSSS  = %s' % 'A' * 40)
>>> bool(m)
True

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Instead of doing this here, we should do it in CredentialsCheckTool.__init__ so that theyre not sitting here taking up memory if the tool isn't used. e.g.

class CredentialsCheckTool(Tool):
    def __init__(self, *args, **kwargs):
        super(CredentialsCheckTool, self).__init__(*args, **kwargs)
        self.compiled_re = [
            re.compile(regex)
            for regex in credential_patterns
        ]

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Docstrings should be of the form:

"""Single line summary.

Multi-line description.
"""

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
We use the Oxford comma, so there should be a comma after private keys.
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been dropped. Show all issues

How about just "Review a single file."?

ammar Oct. 13, 2018, 1:17 p.m.

All tools use the "Perform a review of a single file" verbiage. Do we still want to change this?

brennie Oct. 13, 2018, 2:15 p.m.
```
Nope thats fine!
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Blank line between these.
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

This will not detect files named id_rsa becuase it is not an extension. You will need a separate set of file names (id_rsa, id_dsa, id_ecdsa) versus file extensions (p12, pem, ppk, key).

ammar Oct. 13, 2018, 1:17 p.m.


>>> file_type = "id_rsa".lower().split(".")[-1]
>>> file_type
'id_rsa'
>>> file_type = "my.key".lower().split(".")[-1]
>>> file_type
'key'


file_type actually captures both

brennie Oct. 13, 2018, 2:15 p.m.

Ok ignore this then :)

BTW you can use os.path.splitext

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

We should word this as "may be a security risk" because if its a public key  --PEM files can be encoded public keys -- it totally isn't.

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Comments should be complete sentences: they should begin with a capital letter and end with a period.

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

I would organize this as:

try:
    encoding = chardet.detect(
        f.patched_file_contents)['encoding']

    if encoding:
        contents = f.patched_file_contents.decode(
            encoding, 'strict')
    else:
        # We can't do any more for this file.
        return
except (TypeError, UnicodeError, ValueError):
    return

lines = contents.split('\n')

for line_number, line in enumerate(lines, 1):
    for pattern in compiled_credential_patterns:
        if pattern.match(line):
            f.comment(...)


This lets us use explicit control flow (return) when we are done instead of keeping track of boolean state.

ammar Oct. 13, 2018, 1:17 p.m.
```
Thanks! This is much cleaner.
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

Does bytes.decode(encoding, 'strict') work in Python 3?

ammar Oct. 13, 2018, 1:17 p.m.

Seems to work fine.

Python 3.5.2 (default, Nov 23 2017, 16:37:01) 
[GCC 5.4.0 20160609] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> b'100000000'.decode("ascii", "strict")
'100000000'

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

What exceptions are we hoping to catch? We should be very specific about what we expect so that other exceptions that we aren't expecting aren't also captured.

Looking at the source of chardet.detect, it looks like it only raises TypeError. bytes.decode(..., 'strict') seems to raise ValueError and UnicodeError.

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Blank line between these.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Since pattern is a compiled regular expression, you can just do pattern.match(line)
```
bot/setup.py (Diff revision 2)
The issue has been resolved. Show all issues
```
You're going to need to add a entrypoint for your tool.
```
docs/reviewbot/tools/credentialscheck.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
:file:`.pem`
:file:`id_rsa`
```

docs/reviewbot/tools/credentialscheck.rst (Diff revision 2)

The issue has been resolved. Show all issues

How about:

This tool is built into ReviewBot. There is no separate installation step required.

extension/README.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
Typo: "credntialscheck"
```

Commit:

b872485315ddf3a609b1cd467fc1e958a24586a4

b988408dc3c6a168c18e64b00c89901d5c612fa7

Diff:

Revision 3 (+101)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

The issue has been resolved. Show all issues

Can you wrap your description and testing done at 72 Chars?

The issue has been resolved. Show all issues

Is this a WIP? Your description has "WIP: Add options..."

If this is WIP please put it in the summary

bot/reviewbot/tools/credentials_check.py (Diff revision 3)
The issue has been resolved. Show all issues
```
One more blank line here.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 3)
The issue has been resolved. Show all issues
```
Can you add a docstring here?
```
bot/reviewbot/tools/credentials_check.py (Diff revision 3)
The issue has been resolved. Show all issues
```
No blank line here.
```

bot/setup.py (Diff revision 3)

The issue has been resolved. Show all issues

Can you wrap these in parens to make it clear that this is supposed to be multiple lines?

e.g.

python ('credentialscheck = ' '...'),

This will help in not accidentally adding a comma there in the future.

Description:

~		Previously, ReviewBot did not check for credentials that may have been accidentally included in the commit. A human reviewer would have to look out for them, but we hoped to move more of this task's burden to ReviewBot.
	~	Previously, ReviewBot did not check for credentials that may have been
	+	accidentally included in the commit. A human reviewer would have to
	+	look out for them, but we hoped to move more of this task's burden
	+	to ReviewBot.

~		A new Credentials Check tool has been added to ReviewBot which looks for various key files, other sensitive files and inline embedded AWS credentials to make sure these are not pushed to the repository.
~
~		WIP: Add options to let users enter their own files to ignore.
	~	A new Credentials Check tool has been added to ReviewBot which
	~	looks for various key files, other sensitive files and inline embedded
	~	AWS credentials to make sure these are not pushed to the repository.

Testing Done:

~		Manual tests (correctly finds and creates issues on lines with the credentials, or marks the first line of a file type that should not have been included e.g. .pem files)
	~	Manual tests (correctly finds and creates issues on lines with the
	+	credentials, or marks the first line of a file type (including
	+	file types specified from options tab) that should not have been
	+	included e.g. .pem files).

Commit:

b988408dc3c6a168c18e64b00c89901d5c612fa7

ac317d290bcc069e970f67365e879db2e3affba9

Diff:

Revision 4 (+120)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 4)
The issue has been resolved. Show all issues
```
E128 continuation line under-indented for visual indent
```

Commit:

ac317d290bcc069e970f67365e879db2e3affba9

d5e3af9386608400281a3eb40045da461682af37

Diff:

Revision 5 (+120)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
It looks like these wouldn't catch cases where the value was enclosed in quotes?
```
bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Let's put one per line and sort them all alphabetically.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Should probably be "Including this file ..." (files -> file)
```

bot/reviewbot/tools/credentials_check.py (Diff revision 5)

The issue has been resolved. Show all issues

We should be able to use regex matching against bytestrings, so we can skip the detection/decoding here. We just need to make sure that the patterns use br'...'

ammar Oct. 18, 2018, 4:07 p.m.

What does br mean/achieve? Cannot seem to find any documentation on it.
Also, if we skip decoding, how can we mitigate risk of running the regex against image files etc?

david Oct. 18, 2018, 4:13 p.m.

An r prefix means "raw string", where a \ character is actually a \ (and doesn't need to be escaped with \\). That's generally useful for regexes that use a lot of \ characters. A b prefix means a bytestring (as opposed to unicode text).

As far as binary files and such, we currently don't worry about them (because of the way we store diffs). Even when we add more significant binary file support, I don't think you will need to care here.

bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Do we want to use .search() instead of .match()?
```

bot/reviewbot/tools/credentials_check.py (Diff revision 5)

The issue has been resolved. Show all issues

I feel like we should be more verbose about what the problem might be (for example, "Potential disclosure of private Amazon AWS keys")

docs/reviewbot/tools/credentialscheck.rst (Diff revision 5)
The issue has been resolved. Show all issues
```
A link to this needs to be added to docs/reviewbot/tools/index.rst
```

docs/reviewbot/tools/credentialscheck.rst (Diff revision 5)

The issue has been resolved. Show all issues

This reads a little funky. How about "Improper credentials can include things such as AWS keys hardcoded in source or private key files."

Commit:

d5e3af9386608400281a3eb40045da461682af37

120e7993fdca837f1c8015d2c8f0cb51ccf9cc1d

Diff:

Revision 6 (+111)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 6)
The issue has been resolved. Show all issues
```
E124 closing bracket does not match visual indentation
```

Diff:

Revision 7 (+111)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 7)
The issue has been resolved. Show all issues
```
E124 closing bracket does not match visual indentation
```

Commit:

120e7993fdca837f1c8015d2c8f0cb51ccf9cc1d

fec83230768108490c64c22505158cf99e7db9b6

Diff:

Revision 8 (+112)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

README.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```
bot/README.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

These regexes won't work in a few cases:

Shell scripts with AWS_SECRET_KEY=... i.e., without quotes.
Single-quoted string values

We can get around this with:

{
    'AWS_KEY': br'''(?:AWS_KEY|AWS_ACCESS_KEY|AWS_ACCESS_KEY_ID)\s*=\s*(?P<quote>["']?)[A-Z0-9]{20}(?P=quote)'''
}

bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Single quotes.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Single quotes around AWS_SECRET_KEY
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

Trailing space, but it should also have a period.

Also, comma after e.g..

ammar Oct. 19, 2018, 4:05 p.m.

Also, comma after e.g..

I am not sure what comma you mean by this.

david Oct. 30, 2018, 4:30 p.m.
```
"e.g., pem, key, id_rsa"
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Missing args/kwargs
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

https://docs.python.org/2/library/os.path.html#os.path.splitext

ammar Oct. 19, 2018, 4:05 p.m.

I think current approach seems to end up working nicer.
With spiltext, we need to maintain two sets instead of one (file names and file extensions) and then check for a given file if the file name is in file names set or file extension in file extensions set.

credential_file_names = {
    '.aws_credentials'
    'id_dsa',
    'id_ecdsa',
    'id_rsa',
}

credential_file_types = {
    '.key',
    '.p12',
    '.pem',
    '.ppk',
}

So we would have to keep these sets and then check:

        if (f.file_type in unsafe_file_types or
            f.filename in unsafe_file_types):

where unsafe_file_types = credential_file_types + additional file types specified in options. Options just takes in one list currently A comma-separated list of file names and extensions, and from this list I don't see a way to split items into file types or extensions (both could start with a '.'). So we would have to create a second field in options.

Is there a problem with keeping on using split?

david Oct. 30, 2018, 4:30 p.m.

What you have is fine, but please add a comment explaining the reasoning.

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

.iteritems() is Python2 only. You'll want to do:

import six 

# ...

for name, pattern in six.iteritems(self.compiled_re):

bot/setup.py (Diff revision 8)

The issue has been resolved. Show all issues

This should line up with the string above, e.g.

('... '
 '... '),

bot/setup.py (Diff revision 8)

The issue has been resolved. Show all issues

RBTools depends on six but if we're using it directly, we should have our own dependency on it. (I mention using six in another comment).

docs/reviewbot/tools/credentialscheck.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Trailing whitespace.
```
docs/reviewbot/tools/index.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```
extension/README.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Single quotes here
```

Commit:

fec83230768108490c64c22505158cf99e7db9b6

9eb1b74f700678b4b76838bcc7a0fe2ff3a10727

Diff:

Revision 9 (+117)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/credentials_check.py (Diff revision 9)

The issue has been resolved. Show all issues

six is a third-party library, so it should go in it's own "section":

import re

import six

from reviewbot.tools import Tool

bot/reviewbot/tools/credentials_check.py (Diff revision 9)

The issue has been resolved. Show all issues

Formatting here could be a little nicer. If you put the parens on their own lines, then the strings will line up a bit nicer. We also don't need to use the triple quotes (better to just escape any inner ' characters), but we do need to have the br prefix on each line, even though they get concatenated.

'AWS_KEY': (
    br'...'
    br'...'
),
'AWS_SECRET_KEY': (
    br'...'
    br'...'
),

bot/reviewbot/tools/credentials_check.py (Diff revision 9)

The issue has been resolved. Show all issues

This should use six.iteritems. Also, this can use a dict comprehension to do it all in one go:

super(CredentialsCheckTool, self).__init__()

self.compiled_re = {
    name: re.compile(pattern)
    for name, pattern in six.iteritems(credential_patterns)
}

ammar Oct. 30, 2018, 6:16 p.m.

Thanks! I did not know about dictionary comprehensions.

bot/setup.py (Diff revision 9)

The issue has been resolved. Show all issues

In this case I think it's probably better to just ignore the line length warning.

docs/reviewbot/tools/credentialscheck.rst (Diff revision 9)
The issue has been resolved. Show all issues
```
ReviewBot -> Review Bot
```

Commit:

9eb1b74f700678b4b76838bcc7a0fe2ff3a10727

7cebf8d3cfee0dc7b7a5aa8dea54c2bac753eda6

Diff:

Revision 10 (+123)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 10)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

bot/reviewbot/tools/credentials_check.py (Diff revision 10)
The issue has been resolved. Show all issues
```
ReviewBot -> Review Bot
```
bot/setup.py (Diff revision 10)
The issue has been resolved. Show all issues
```
Should be in alphabetical order?
```

Commit:

7cebf8d3cfee0dc7b7a5aa8dea54c2bac753eda6

544b14a372d66e599399149480919555ce0004d0

Diff:

Revision 11 (+123)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 11)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

bot/reviewbot/tools/credentials_check.py (Diff revision 11)

The issue has been resolved. Show all issues

Will this fit as:

            for risk_name, pattern in six.iteritems(
                self._compiled_re):
                # ...

Branch:

release-1.0.x

master

Commit:

544b14a372d66e599399149480919555ce0004d0

554f3e838d5a44738f97469fe144da2ea6d821ad

Diff:

Revision 12 (+122)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 12)
The issue has been resolved. Show all issues
```
F821 undefined name 'compiled_pattern'
```
bot/setup.py (Diff revision 12)
The issue has been resolved. Show all issues
```
E501 line too long (88 > 79 characters)
```

Commit:

554f3e838d5a44738f97469fe144da2ea6d821ad

4047beed2fa56a129d6e9a9f39d6d687573577f4

Diff:

Revision 13 (+122)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 13)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

Ship it!

```
Ship It!
```

```
Two tiny nits:
```

The issue has been resolved. Show all issues

In your change description: "ReviewBot" -> "Review Bot" (3x)

docs/reviewbot/tools/credentialscheck.rst (Diff revision 13)
The issue has been resolved. Show all issues
```
Add another blank line here.
```

Description:

~		Previously, ReviewBot did not check for credentials that may have been
	~	Previously, Review Bot did not check for credentials that may have been
		accidentally included in the commit. A human reviewer would have to
		look out for them, but we hoped to move more of this task's burden
~		to ReviewBot.
	~	to Review Bot.

~		A new Credentials Check tool has been added to ReviewBot which
	~	A new Credentials Check tool has been added to Review Bot which
		looks for various key files, other sensitive files and inline embedded
		AWS credentials to make sure these are not pushed to the repository.

Commit:

4047beed2fa56a129d6e9a9f39d6d687573577f4

d332e509b24207efeaa0b32781ce14c05acd4d08

Diff:

Revision 14 (+123)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 14)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

Status:: Discarded