This change has been discarded.

Describe the reason it's discarded (optional):

Summary

Add credentials check tool

Review Request #10223 — Created Oct. 11, 2018 and discarded 4 years, 7 months ago

Information

Owner

ammar*

Repository

ReviewBot

Branch

master

Bugs

Depends On

Reviewers

Groups

reviewbot, students

People

Description*

Previously, Review Bot did not check for credentials that may have been
accidentally included in the commit. A human reviewer would have to
look out for them, but we hoped to move more of this task's burden
to Review Bot.

A new Credentials Check tool has been added to Review Bot which
looks for various key files, other sensitive files and inline embedded
AWS credentials to make sure these are not pushed to the repository.

Testing Done

Manual tests (correctly finds and creates issues on lines with the
credentials, or marks the first line of a file type (including
file types specified from options tab) that should not have been
included e.g. .pem files).

Issues

Description	From	Last Updated
Can you wrap your description and testing done at 72 Chars?	brennie	6 years, 8 months ago
Is this a WIP? Your description has "WIP: Add options..." If this is WIP please put it in the summary	brennie	6 years, 8 months ago
In your change description: "ReviewBot" -> "Review Bot" (3x)	david	6 years, 7 months ago
E501 line too long (83 > 79 characters)	reviewbot	6 years, 9 months ago
E501 line too long (84 > 79 characters)	reviewbot	6 years, 9 months ago
E501 line too long (85 > 79 characters)	reviewbot	6 years, 9 months ago
E501 line too long (80 > 79 characters)	reviewbot	6 years, 9 months ago
E501 line too long (84 > 79 characters)	reviewbot	6 years, 9 months ago
E501 line too long (89 > 79 characters)	reviewbot	6 years, 9 months ago
typo: "credntialscheck"	brennie	6 years, 9 months ago
typo: "credntialscheck"	brennie	6 years, 9 months ago
Missing module-level docstring	brennie	6 years, 9 months ago
Module imports should be formatted as: from __future__ import ... # Python STDLib imports # 3rd party imports # Imports …	brennie	6 years, 9 months ago
Instead of having multiple credential regexes, we can make this into a single regular expression: compiled_credential_pattern = re.compile( '\|'.join( '(%s)' …	brennie	6 years, 9 months ago
Single quotes here and throughout	brennie	6 years, 9 months ago
Missing trailing comma. This regex doesn't do what you want it to becuase of the leading [, which makes everything …	brennie	6 years, 9 months ago
Instead of doing this here, we should do it in CredentialsCheckTool.__init__ so that theyre not sitting here taking up memory …	brennie	6 years, 9 months ago
Docstrings should be of the form: """Single line summary. Multi-line description. """	brennie	6 years, 9 months ago
We use the Oxford comma, so there should be a comma after private keys.	brennie	6 years, 9 months ago
How about just "Review a single file."?	brennie	6 years, 9 months ago
Blank line between these.	brennie	6 years, 9 months ago
This will not detect files named id_rsa becuase it is not an extension. You will need a separate set of …	brennie	6 years, 9 months ago
We should word this as "may be a security risk" because if its a public key --PEM files can be …	brennie	6 years, 9 months ago
Comments should be complete sentences: they should begin with a capital letter and end with a period.	brennie	6 years, 9 months ago
What exceptions are we hoping to catch? We should be very specific about what we expect so that other exceptions …	brennie	6 years, 9 months ago
Blank line between these.	brennie	6 years, 9 months ago
Since pattern is a compiled regular expression, you can just do pattern.match(line)	brennie	6 years, 9 months ago
You're going to need to add a entrypoint for your tool.	brennie	6 years, 9 months ago
:file:`.pem` :file:`id_rsa`	brennie	6 years, 9 months ago
How about: This tool is built into ReviewBot. There is no separate installation step required.	brennie	6 years, 9 months ago
Typo: "credntialscheck"	brennie	6 years, 9 months ago
One more blank line here.	brennie	6 years, 8 months ago
Can you add a docstring here?	brennie	6 years, 8 months ago
No blank line here.	brennie	6 years, 8 months ago
Can you wrap these in parens to make it clear that this is supposed to be multiple lines? e.g. python …	brennie	6 years, 8 months ago
E128 continuation line under-indented for visual indent	reviewbot	6 years, 8 months ago
It looks like these wouldn't catch cases where the value was enclosed in quotes?	david	6 years, 8 months ago
Let's put one per line and sort them all alphabetically.	david	6 years, 8 months ago
Should probably be "Including this file ..." (files -> file)	david	6 years, 8 months ago
We should be able to use regex matching against bytestrings, so we can skip the detection/decoding here. We just need …	david	6 years, 8 months ago
Do we want to use .search() instead of .match()?	david	6 years, 8 months ago
I feel like we should be more verbose about what the problem might be (for example, "Potential disclosure of private …	david	6 years, 8 months ago
A link to this needs to be added to docs/reviewbot/tools/index.rst	david	6 years, 8 months ago
This reads a little funky. How about "Improper credentials can include things such as AWS keys hardcoded in source or …	david	6 years, 8 months ago
E124 closing bracket does not match visual indentation	reviewbot	6 years, 8 months ago
E124 closing bracket does not match visual indentation	reviewbot	6 years, 8 months ago
Can you insert this in alphabetical order?	brennie	6 years, 8 months ago
Can you insert this in alphabetical order?	brennie	6 years, 8 months ago
These regexes won't work in a few cases: Shell scripts with AWS_SECRET_KEY=... i.e., without quotes. Single-quoted string values We can …	brennie	6 years, 8 months ago
Single quotes.	brennie	6 years, 8 months ago
Single quotes around AWS_SECRET_KEY	brennie	6 years, 8 months ago
Single quotes here	alextechcc	6 years, 8 months ago
Trailing space, but it should also have a period. Also, comma after e.g..	brennie	6 years, 8 months ago
Missing args/kwargs	brennie	6 years, 8 months ago
https://docs.python.org/2/library/os.path.html#os.path.splitext	brennie	6 years, 8 months ago
.iteritems() is Python2 only. You'll want to do: import six # ... for name, pattern in six.iteritems(self.compiled_re):	brennie	6 years, 8 months ago
This should line up with the string above, e.g. ('... ' '... '),	brennie	6 years, 8 months ago
RBTools depends on six but if we're using it directly, we should have our own dependency on it. (I mention …	brennie	6 years, 8 months ago
Trailing whitespace.	brennie	6 years, 8 months ago
Can you insert this in alphabetical order?	brennie	6 years, 8 months ago
Can you insert this in alphabetical order?	brennie	6 years, 8 months ago
six is a third-party library, so it should go in it's own "section": import re import six from reviewbot.tools import …	david	6 years, 8 months ago
Formatting here could be a little nicer. If you put the parens on their own lines, then the strings will …	david	6 years, 8 months ago
This should use six.iteritems. Also, this can use a dict comprehension to do it all in one go: super(CredentialsCheckTool, self).__init__() …	david	6 years, 8 months ago
In this case I think it's probably better to just ignore the line length warning.	david	6 years, 8 months ago
ReviewBot -> Review Bot	david	6 years, 8 months ago
ReviewBot -> Review Bot	ilaw	6 years, 8 months ago
E501 line too long (88 > 79 characters)	reviewbot	6 years, 8 months ago
Should be in alphabetical order?	ilaw	6 years, 8 months ago
Will this fit as: for risk_name, pattern in six.iteritems( self._compiled_re): # ...	brennie	6 years, 8 months ago
E501 line too long (88 > 79 characters)	reviewbot	6 years, 8 months ago
F821 undefined name 'compiled_pattern'	reviewbot	6 years, 8 months ago
E501 line too long (88 > 79 characters)	reviewbot	6 years, 8 months ago
E501 line too long (88 > 79 characters)	reviewbot	6 years, 8 months ago
Add another blank line here.	david	6 years, 7 months ago
E501 line too long (88 > 79 characters)	reviewbot	6 years, 7 months ago
There are no open issues

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (83 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (84 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (85 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (80 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (84 > 79 characters)
```
bot/reviewbot/tools/credentials_check.py (Diff revision 1)
The issue has been resolved. Show all issues
```
E501 line too long (89 > 79 characters)
```

Commit:

ac44d7ecf125fc206d0459a16ce2b3c81d7bf6a7

b872485315ddf3a609b1cd467fc1e958a24586a4

Diff:

Revision 2 (+86)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

README.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
typo: "credntialscheck"
```
bot/README.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
typo: "credntialscheck"
```
bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Missing module-level docstring
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Module imports should be formatted as:

from __future__ import ...

# Python STDLib imports

# 3rd party imports

# Imports from this package


e.g.

from __future__ import unicode_literals

import re

import chardet

from reviewbot.tools import Tool

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Instead of having multiple credential regexes, we can make this into a single regular expression:

compiled_credential_pattern = re.compile(
    '|'.join(
        '(%s)' % pattern
        for pattern in credential_patterns
    )
)

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Single quotes here and throughout
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Missing trailing comma.
This regex doesn't do what you want it to becuase of the leading [, which makes everything up to ] a character this regex will match.
e.g.

>>> import re
>>> x = re.compile(r"[AWS_SECRET_KEY\s*=\s*[A-Za-z0-9/+=]{40}")
>>> m = x.match('SSSSSSSSSSSS  = %s' % 'A' * 40)
>>> bool(m)
True

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Instead of doing this here, we should do it in CredentialsCheckTool.__init__ so that theyre not sitting here taking up memory if the tool isn't used. e.g.

class CredentialsCheckTool(Tool):
    def __init__(self, *args, **kwargs):
        super(CredentialsCheckTool, self).__init__(*args, **kwargs)
        self.compiled_re = [
            re.compile(regex)
            for regex in credential_patterns
        ]

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Docstrings should be of the form:

"""Single line summary.

Multi-line description.
"""

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
We use the Oxford comma, so there should be a comma after private keys.
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been dropped. Show all issues

How about just "Review a single file."?

ammar 6 years, 9 months ago

All tools use the "Perform a review of a single file" verbiage. Do we still want to change this?

brennie 6 years, 9 months ago
```
Nope thats fine!
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Blank line between these.
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

This will not detect files named id_rsa becuase it is not an extension. You will need a separate set of file names (id_rsa, id_dsa, id_ecdsa) versus file extensions (p12, pem, ppk, key).

ammar 6 years, 9 months ago


>>> file_type = "id_rsa".lower().split(".")[-1]
>>> file_type
'id_rsa'
>>> file_type = "my.key".lower().split(".")[-1]
>>> file_type
'key'


file_type actually captures both

brennie 6 years, 9 months ago

Ok ignore this then :)

BTW you can use os.path.splitext

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

We should word this as "may be a security risk" because if its a public key  --PEM files can be encoded public keys -- it totally isn't.

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

Comments should be complete sentences: they should begin with a capital letter and end with a period.

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

I would organize this as:

try:
    encoding = chardet.detect(
        f.patched_file_contents)['encoding']

    if encoding:
        contents = f.patched_file_contents.decode(
            encoding, 'strict')
    else:
        # We can't do any more for this file.
        return
except (TypeError, UnicodeError, ValueError):
    return

lines = contents.split('\n')

for line_number, line in enumerate(lines, 1):
    for pattern in compiled_credential_patterns:
        if pattern.match(line):
            f.comment(...)


This lets us use explicit control flow (return) when we are done instead of keeping track of boolean state.

ammar 6 years, 9 months ago
```
Thanks! This is much cleaner.
```

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

Does bytes.decode(encoding, 'strict') work in Python 3?

ammar 6 years, 9 months ago

Seems to work fine.

Python 3.5.2 (default, Nov 23 2017, 16:37:01) 
[GCC 5.4.0 20160609] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> b'100000000'.decode("ascii", "strict")
'100000000'

bot/reviewbot/tools/credentials_check.py (Diff revision 2)

The issue has been resolved. Show all issues

What exceptions are we hoping to catch? We should be very specific about what we expect so that other exceptions that we aren't expecting aren't also captured.

Looking at the source of chardet.detect, it looks like it only raises TypeError. bytes.decode(..., 'strict') seems to raise ValueError and UnicodeError.

bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Blank line between these.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Since pattern is a compiled regular expression, you can just do pattern.match(line)
```
bot/setup.py (Diff revision 2)
The issue has been resolved. Show all issues
```
You're going to need to add a entrypoint for your tool.
```
docs/reviewbot/tools/credentialscheck.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
:file:`.pem`
:file:`id_rsa`
```

docs/reviewbot/tools/credentialscheck.rst (Diff revision 2)

The issue has been resolved. Show all issues

How about:

This tool is built into ReviewBot. There is no separate installation step required.

extension/README.rst (Diff revision 2)
The issue has been resolved. Show all issues
```
Typo: "credntialscheck"
```

Commit:

b872485315ddf3a609b1cd467fc1e958a24586a4

b988408dc3c6a168c18e64b00c89901d5c612fa7

Diff:

Revision 3 (+101)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

The issue has been resolved. Show all issues

Can you wrap your description and testing done at 72 Chars?

The issue has been resolved. Show all issues

Is this a WIP? Your description has "WIP: Add options..."

If this is WIP please put it in the summary

bot/reviewbot/tools/credentials_check.py (Diff revision 3)
The issue has been resolved. Show all issues
```
One more blank line here.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 3)
The issue has been resolved. Show all issues
```
Can you add a docstring here?
```
bot/reviewbot/tools/credentials_check.py (Diff revision 3)
The issue has been resolved. Show all issues
```
No blank line here.
```

bot/setup.py (Diff revision 3)

The issue has been resolved. Show all issues

Can you wrap these in parens to make it clear that this is supposed to be multiple lines?

e.g.

python ('credentialscheck = ' '...'),

This will help in not accidentally adding a comma there in the future.

Description:

~		Previously, ReviewBot did not check for credentials that may have been accidentally included in the commit. A human reviewer would have to look out for them, but we hoped to move more of this task's burden to ReviewBot.
	~	Previously, ReviewBot did not check for credentials that may have been
	+	accidentally included in the commit. A human reviewer would have to
	+	look out for them, but we hoped to move more of this task's burden
	+	to ReviewBot.

~		A new Credentials Check tool has been added to ReviewBot which looks for various key files, other sensitive files and inline embedded AWS credentials to make sure these are not pushed to the repository.
~
~		WIP: Add options to let users enter their own files to ignore.
	~	A new Credentials Check tool has been added to ReviewBot which
	~	looks for various key files, other sensitive files and inline embedded
	~	AWS credentials to make sure these are not pushed to the repository.

Testing Done:

~		Manual tests (correctly finds and creates issues on lines with the credentials, or marks the first line of a file type that should not have been included e.g. .pem files)
	~	Manual tests (correctly finds and creates issues on lines with the
	+	credentials, or marks the first line of a file type (including
	+	file types specified from options tab) that should not have been
	+	included e.g. .pem files).

Commit:

b988408dc3c6a168c18e64b00c89901d5c612fa7

ac317d290bcc069e970f67365e879db2e3affba9

Diff:

Revision 4 (+120)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 4)
The issue has been resolved. Show all issues
```
E128 continuation line under-indented for visual indent
```

Commit:

ac317d290bcc069e970f67365e879db2e3affba9

d5e3af9386608400281a3eb40045da461682af37

Diff:

Revision 5 (+120)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
It looks like these wouldn't catch cases where the value was enclosed in quotes?
```
bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Let's put one per line and sort them all alphabetically.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Should probably be "Including this file ..." (files -> file)
```

bot/reviewbot/tools/credentials_check.py (Diff revision 5)

The issue has been resolved. Show all issues

We should be able to use regex matching against bytestrings, so we can skip the detection/decoding here. We just need to make sure that the patterns use br'...'

ammar 6 years, 8 months ago

What does br mean/achieve? Cannot seem to find any documentation on it.
Also, if we skip decoding, how can we mitigate risk of running the regex against image files etc?

david 6 years, 8 months ago

An r prefix means "raw string", where a \ character is actually a \ (and doesn't need to be escaped with \\). That's generally useful for regexes that use a lot of \ characters. A b prefix means a bytestring (as opposed to unicode text).

As far as binary files and such, we currently don't worry about them (because of the way we store diffs). Even when we add more significant binary file support, I don't think you will need to care here.

bot/reviewbot/tools/credentials_check.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Do we want to use .search() instead of .match()?
```

bot/reviewbot/tools/credentials_check.py (Diff revision 5)

The issue has been resolved. Show all issues

I feel like we should be more verbose about what the problem might be (for example, "Potential disclosure of private Amazon AWS keys")

docs/reviewbot/tools/credentialscheck.rst (Diff revision 5)
The issue has been resolved. Show all issues
```
A link to this needs to be added to docs/reviewbot/tools/index.rst
```

docs/reviewbot/tools/credentialscheck.rst (Diff revision 5)

The issue has been resolved. Show all issues

This reads a little funky. How about "Improper credentials can include things such as AWS keys hardcoded in source or private key files."

Commit:

d5e3af9386608400281a3eb40045da461682af37

120e7993fdca837f1c8015d2c8f0cb51ccf9cc1d

Diff:

Revision 6 (+111)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 6)
The issue has been resolved. Show all issues
```
E124 closing bracket does not match visual indentation
```

Diff:

Revision 7 (+111)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 7)
The issue has been resolved. Show all issues
```
E124 closing bracket does not match visual indentation
```

Commit:

120e7993fdca837f1c8015d2c8f0cb51ccf9cc1d

fec83230768108490c64c22505158cf99e7db9b6

Diff:

Revision 8 (+112)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

README.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```
bot/README.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

These regexes won't work in a few cases:

Shell scripts with AWS_SECRET_KEY=... i.e., without quotes.
Single-quoted string values

We can get around this with:

{
    'AWS_KEY': br'''(?:AWS_KEY|AWS_ACCESS_KEY|AWS_ACCESS_KEY_ID)\s*=\s*(?P<quote>["']?)[A-Z0-9]{20}(?P=quote)'''
}

bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Single quotes.
```
bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Single quotes around AWS_SECRET_KEY
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

Trailing space, but it should also have a period.

Also, comma after e.g..

ammar 6 years, 8 months ago

Also, comma after e.g..

I am not sure what comma you mean by this.

david 6 years, 8 months ago
```
"e.g., pem, key, id_rsa"
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Missing args/kwargs
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

https://docs.python.org/2/library/os.path.html#os.path.splitext

ammar 6 years, 8 months ago

I think current approach seems to end up working nicer.
With spiltext, we need to maintain two sets instead of one (file names and file extensions) and then check for a given file if the file name is in file names set or file extension in file extensions set.

credential_file_names = {
    '.aws_credentials'
    'id_dsa',
    'id_ecdsa',
    'id_rsa',
}

credential_file_types = {
    '.key',
    '.p12',
    '.pem',
    '.ppk',
}

So we would have to keep these sets and then check:

        if (f.file_type in unsafe_file_types or
            f.filename in unsafe_file_types):

where unsafe_file_types = credential_file_types + additional file types specified in options. Options just takes in one list currently A comma-separated list of file names and extensions, and from this list I don't see a way to split items into file types or extensions (both could start with a '.'). So we would have to create a second field in options.

Is there a problem with keeping on using split?

david 6 years, 8 months ago

What you have is fine, but please add a comment explaining the reasoning.

bot/reviewbot/tools/credentials_check.py (Diff revision 8)

The issue has been resolved. Show all issues

.iteritems() is Python2 only. You'll want to do:

import six 

# ...

for name, pattern in six.iteritems(self.compiled_re):

bot/setup.py (Diff revision 8)

The issue has been resolved. Show all issues

This should line up with the string above, e.g.

('... '
 '... '),

bot/setup.py (Diff revision 8)

The issue has been resolved. Show all issues

RBTools depends on six but if we're using it directly, we should have our own dependency on it. (I mention using six in another comment).

docs/reviewbot/tools/credentialscheck.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Trailing whitespace.
```
docs/reviewbot/tools/index.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```
extension/README.rst (Diff revision 8)
The issue has been resolved. Show all issues
```
Can you insert this in alphabetical order?
```

bot/reviewbot/tools/credentials_check.py (Diff revision 8)
The issue has been resolved. Show all issues
```
Single quotes here
```

Commit:

fec83230768108490c64c22505158cf99e7db9b6

9eb1b74f700678b4b76838bcc7a0fe2ff3a10727

Diff:

Revision 9 (+117)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/credentials_check.py (Diff revision 9)

The issue has been resolved. Show all issues

six is a third-party library, so it should go in it's own "section":

import re

import six

from reviewbot.tools import Tool

bot/reviewbot/tools/credentials_check.py (Diff revision 9)

The issue has been resolved. Show all issues

Formatting here could be a little nicer. If you put the parens on their own lines, then the strings will line up a bit nicer. We also don't need to use the triple quotes (better to just escape any inner ' characters), but we do need to have the br prefix on each line, even though they get concatenated.

'AWS_KEY': (
    br'...'
    br'...'
),
'AWS_SECRET_KEY': (
    br'...'
    br'...'
),

bot/reviewbot/tools/credentials_check.py (Diff revision 9)

The issue has been resolved. Show all issues

This should use six.iteritems. Also, this can use a dict comprehension to do it all in one go:

super(CredentialsCheckTool, self).__init__()

self.compiled_re = {
    name: re.compile(pattern)
    for name, pattern in six.iteritems(credential_patterns)
}

ammar 6 years, 8 months ago

Thanks! I did not know about dictionary comprehensions.

bot/setup.py (Diff revision 9)

The issue has been resolved. Show all issues

In this case I think it's probably better to just ignore the line length warning.

docs/reviewbot/tools/credentialscheck.rst (Diff revision 9)
The issue has been resolved. Show all issues
```
ReviewBot -> Review Bot
```

Commit:

9eb1b74f700678b4b76838bcc7a0fe2ff3a10727

7cebf8d3cfee0dc7b7a5aa8dea54c2bac753eda6

Diff:

Revision 10 (+123)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 10)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

bot/reviewbot/tools/credentials_check.py (Diff revision 10)
The issue has been resolved. Show all issues
```
ReviewBot -> Review Bot
```
bot/setup.py (Diff revision 10)
The issue has been resolved. Show all issues
```
Should be in alphabetical order?
```

Commit:

7cebf8d3cfee0dc7b7a5aa8dea54c2bac753eda6

544b14a372d66e599399149480919555ce0004d0

Diff:

Revision 11 (+123)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 11)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

bot/reviewbot/tools/credentials_check.py (Diff revision 11)

The issue has been resolved. Show all issues

Will this fit as:

            for risk_name, pattern in six.iteritems(
                self._compiled_re):
                # ...

Branch:

release-1.0.x

master

Commit:

544b14a372d66e599399149480919555ce0004d0

554f3e838d5a44738f97469fe144da2ea6d821ad

Diff:

Revision 12 (+122)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/reviewbot/tools/credentials_check.py (Diff revision 12)
The issue has been resolved. Show all issues
```
F821 undefined name 'compiled_pattern'
```
bot/setup.py (Diff revision 12)
The issue has been resolved. Show all issues
```
E501 line too long (88 > 79 characters)
```

Commit:

554f3e838d5a44738f97469fe144da2ea6d821ad

4047beed2fa56a129d6e9a9f39d6d687573577f4

Diff:

Revision 13 (+122)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 13)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

Ship it!

```
Ship It!
```

```
Two tiny nits:
```

The issue has been resolved. Show all issues

In your change description: "ReviewBot" -> "Review Bot" (3x)

docs/reviewbot/tools/credentialscheck.rst (Diff revision 13)
The issue has been resolved. Show all issues
```
Add another blank line here.
```

Description:

~		Previously, ReviewBot did not check for credentials that may have been
	~	Previously, Review Bot did not check for credentials that may have been
		accidentally included in the commit. A human reviewer would have to
		look out for them, but we hoped to move more of this task's burden
~		to ReviewBot.
	~	to Review Bot.

~		A new Credentials Check tool has been added to ReviewBot which
	~	A new Credentials Check tool has been added to Review Bot which
		looks for various key files, other sensitive files and inline embedded
		AWS credentials to make sure these are not pushed to the repository.

Commit:

4047beed2fa56a129d6e9a9f39d6d687573577f4

d332e509b24207efeaa0b32781ce14c05acd4d08

Diff:

Revision 14 (+123)

Show changes

	README.rst
	bot/README.rst
	bot/setup.py
	bot/reviewbot/tools/credentials_check.py
	docs/reviewbot/tools/credentialscheck.rst
	docs/reviewbot/tools/index.rst
	extension/README.rst

Checks run (1 failed, 1 succeeded)

flake8 failed.

JSHint passed.

flake8

bot/setup.py (Diff revision 14)
The issue has been dropped. Show all issues
```
E501 line too long (88 > 79 characters)
```

Status:: Discarded