Summary

Go Tool for Review Bot

Review Request #11240 — Created Oct. 22, 2020 and submitted Jan. 4, 2021, 1:01 p.m.

Information

Owner

jblazusi

Repository

ReviewBot

Branch

master

Bugs

Depends On

Commit

61c94b0...

Reviewers

Groups

reviewbot, students

People

ceciliawei

Description

This tool's ability to format was removed and placed into another tool,

that tool is tentatively known as the GofmtTool. This was done because,

go fmt is rather lightweight and does not rely on other files in order

to carry out its function. However, go test and go vet require more

information in order to work consistently. In particular they need

access to the entire repository so that patched code can be fully

analyzed against the package code. Therefore GofmtTool inherits from

Tool, whereas GoTool inherits from RepositoryTool and is generally

a much more computationally heavy tool.

Testing Done

Manual testing was done to confirm that go test is working correctly.
As of now it just creates general comments based on which tests have
failed.
Manual testing was done to confirm that go vet is working and known
issues about failures when used against test files have been resolved.

Files

Issues

Description	From	Last Updated
Go/Go Tools?	david	Oct. 29, 2020, 11:17 p.m.
gofmt still exists, but I believe the modern recommendation is to use go fmt. Can we do that here to …	david	Nov. 4, 2020, 11:16 p.m.
Perhaps "This file contains formatting errors and should be run through go fmt" ? Also, shouldn't this be checking the …	david	Nov. 4, 2020, 11:24 p.m.
Since we have the output already in a variable, it's kind of silly to write it out to a file …	david	Nov. 4, 2020, 11:26 p.m.
The imports should be in alphabetical order. So import json should be placed before import logging.	ceciliawei	Nov. 17, 2020, 9:51 p.m.
Using the string's "split" method isn't portable (for example, windows doesn't use "/" as a path splitter). That said, there's …	david	Nov. 17, 2020, 9:53 p.m.
It looks like we're running this for every single file, which means that if you have multiple changed files in …	david	Nov. 17, 2020, 9:56 p.m.
Let's pull the result of dest_file.lower() out into a variable so we don't have to call it twice in these …	chipx86	Dec. 3, 2020, 2:19 p.m.
Blank line between statements and the start of new blocks.	chipx86	Dec. 3, 2020, 2:20 p.m.
Same here.	chipx86	Dec. 3, 2020, 2:20 p.m.
This will scan the entirety of packages every iteration of files. If packages is a set, this will be faster, …	chipx86	Dec. 3, 2020, 2:34 p.m.
This can fail, so we'll want to check for exceptions, just as we do further down.	chipx86	Dec. 3, 2020, 2:45 p.m.
It's generally better to use %-formatted strings to join in variables, as this is faster in Python.	chipx86	Dec. 3, 2020, 2:38 p.m.
We prefer %-formatted strings, rather than .format(), as it's technically faster and more consistent with the rest of our codebase. …	chipx86	Dec. 3, 2020, 2:48 p.m.
When spanning lines, we prefer the % on the line with the variables, as it gives more room for the …	chipx86	Dec. 3, 2020, 2:50 p.m.
No blank line here.	chipx86	Dec. 3, 2020, 2:48 p.m.
Can you incorporate the package name in here, or something to help debug this if this comes up in production?	chipx86	Dec. 3, 2020, 3:19 p.m.
Same as above regarding %-formatted strings.	chipx86	Dec. 3, 2020, 2:51 p.m.
Let's use single quotes here, since the inner string doesn't use them. We prefer single quotes in Python strings wherever …	chipx86	Dec. 3, 2020, 2:52 p.m.
If we compile this regex before we do the outer loop, it'll speed this part up.	chipx86	Dec. 6, 2020, 11:40 a.m.
While common in Python, we shouldn't ever override _. That's because _ is commonly used as an alias to ugettext …	chipx86	Dec. 3, 2020, 7:47 p.m.
Rather than putting this into a variable and pulling the indexes out, let's just unpack with: filename, line_num = \ …	chipx86	Dec. 3, 2020, 3:13 p.m.
You can put the format argument on the same line, since it fits. No need for parens there either, since …	chipx86	Dec. 3, 2020, 3:04 p.m.
No blank line here.	chipx86	Dec. 3, 2020, 2:53 p.m.
Same comment as above regarding having identifying information in the message.	chipx86	Dec. 3, 2020, 2:58 p.m.
Things within here are getting pretty deeply nested. Maybe break out the test and vet implementations into their own methods?	david	Dec. 6, 2020, 3:19 p.m.
This should be wrapped in a try/except.	david	Dec. 6, 2020, 3:19 p.m.
Would it be possible to add a comment showing an example of what the JSON output from go vet looks …	david	Dec. 6, 2020, 3:19 p.m.
This is potentially running a lot of times, since it's deeply nested inside several loops. At the top level, can …	david	Dec. 6, 2020, 3:19 p.m.

flake8 passed.

JSHint passed.

Commit:

af6ed353931a4e3ddecaa739c08618f2e8d52532

4ad11e7957ebe6b29f6907d0eb8268814d2a4623

Diff:

Revision 2 (+69)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/gotool.py (Diff revision 2)
The issue has been resolved. Show all issues
```
Go/Go Tools?
```
1. jblazusi Oct. 29, 2020, 11:18 p.m.
  My mistake, I was not exactly sure whether they were separate or not at the time. It is fixed now.

bot/reviewbot/tools/gotool.py (Diff revision 2)

It looks like you're leaning towards making the different uses of go optional, which is good. We should probably default to true for everything except go test, which should default to false (test suites can often be cumbersome and long-running).

jblazusi Oct. 29, 2020, 11:18 p.m.

That is a good idea, I will make sure to implement it that way.

Description:

~		The tool is currently only capable of checking whether a file is
~		correctly formatted or not based on `go fmt`. Work still needs to be
~		done to implement `go fix`, `go test`, and `go vet`.
	~	The tool is currently capable of checking whether a file is
	~	correctly formatted or not based on `go fmt`. As well as doing some
	~	static analysis using `go vet`, more time is needed to complete the
	+	`go test` feature.

Testing Done:

~		Manual testing was done to confirm that `go fmt` is working correctly.
	~	Manual testing was done to confirm that `go fmt` is working correctly.
	+	Manual testing was done to confirm that `go vet` is working in almost
	+	all cases. There are known issues right now with test files and correct
	+	adjustments to the code will be made in the next week.

Commit:

4ad11e7957ebe6b29f6907d0eb8268814d2a4623

3baa171c9d1df9e5b9533aac18e075a4582930f9

Diff:

Revision 3 (+110)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Added Files:

bot/reviewbot/tools/gotool.py (Diff revision 3)

The issue has been resolved. Show all issues

gofmt still exists, but I believe the modern recommendation is to use go fmt. Can we do that here to be more future-proof?

jblazusi Nov. 4, 2020, 11:26 p.m.

I believe that this is generally true, however, it is important to note that go fmt runs gofmt -l -w. The -w flag is responsible for overwriting the file, although I do not think that this is an issue, especially since patched files are temporary. But I do think that it is worth noting in case, whomever takes over the project is experiencing issues relating to overwriting a file.

In short, I have updated the command to use go fmt and this can be found in the new GofmtTool CR.

bot/reviewbot/tools/gotool.py (Diff revision 3)

The issue has been resolved. Show all issues

Perhaps "This file contains formatting errors and should be run through go fmt" ?
Also, shouldn't this be checking the output or return value from the format command? Seems like it just unconditionally adds the comment now (maybe that's still part of the WIP?)

jblazusi Nov. 4, 2020, 11:26 p.m.

I addressed the message in the new GofmtTool. I originally did not setup conditionals, since I wanted to make sure I was getting the correct output, so this is an artifact from early testing.

bot/reviewbot/tools/gotool.py (Diff revision 3)

The issue has been resolved. Show all issues

Since we have the output already in a variable, it's kind of silly to write it out to a file and load it back in. We can just do:

try:
    json_data = json.loads(cleaned_output)
except Exception as e:
    ...

jblazusi Nov. 4, 2020, 11:26 p.m.

This is definitely a blunder. I was a so caught up in using files from previous tools, that I completely forgot about just passing in the output string.

Summary:

[WIP] Go Tool for ReviewBot

Go Tool for ReviewBot

Description:

~		The tool is currently capable of checking whether a file is
~		correctly formatted or not based on `go fmt`. As well as doing some
~		static analysis using `go vet`, more time is needed to complete the
~		`go test` feature.
	~	This tool's ability to format was removed and placed into another tool,
	~	that tool is tentatively known as the GofmtTool. This was done because,
	~	`go fmt` is rather lightweight and does not rely on other files in order
	~	to carry out its function. However, `go test` and `go vet` require more
	+	information in order to work consistently. In particular they need
	+	access to the entire repository so that patched code can be fully
	+	analyzed against the package code. Therefore GofmtTool inherits from
	+	`Tool`, whereas GoTool inherits from `RepositoryTool` and is generally
	+	a much more computationally heavy tool.

Testing Done:

~		Manual testing was done to confirm that `go fmt` is working correctly.
~		Manual testing was done to confirm that `go vet` is working in almost
~		all cases. There are known issues right now with test files and correct
~		adjustments to the code will be made in the next week.
	~	Manual testing was done to confirm that `go test` is working correctly.
	~	As of now it just creates general comments based on which tests have
	~	failed.
	~	Manual testing was done to confirm that `go vet` is working and known
	+	issues about failures when used against test files have been resolved.

Commit:

3baa171c9d1df9e5b9533aac18e075a4582930f9

61427f66214d6fa076bdeb38f115e9662266f7dc

Diff:

Revision 4 (+138)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Added Files:

bot/reviewbot/tools/gotool.py (Diff revision 4)

The issue has been resolved. Show all issues

The imports should be in alphabetical order. So import json should be placed before import logging.

jblazusi Nov. 17, 2020, 9:56 p.m.

Thank you, I will make sure I make this update in previous and future CRs.

bot/reviewbot/tools/gotool.py (Diff revision 4)

The issue has been resolved. Show all issues

Using the string's "split" method isn't portable (for example, windows doesn't use "/" as a path splitter). That said, there's both a portable and easier way to do this:

package = os.path.dirname(path)

jblazusi Nov. 17, 2020, 9:57 p.m.

I was not aware of this being a portability issue, thank you so much for the advice. I have updated my code to use the os.path module in other areas as well.

bot/reviewbot/tools/gotool.py (Diff revision 4)

The issue has been resolved. Show all issues

It looks like we're running this for every single file, which means that if you have multiple changed files in a given package, we'll add the test failures multiple times.

Instead of overriding handle_file, can we instead override handle_files, and use the file list to build a list of changed packages? We can then run the tests once per package.

jblazusi Nov. 17, 2020, 9:57 p.m.

Excellent, this is exactly what I had in mind when I was optimizing my FBInfer Tool. However, the advice of building a list of changed packages was a very useful starting point.

Commit:

61427f66214d6fa076bdeb38f115e9662266f7dc

8ab7b4cf5335e42e78ad7e233b5972eeccaa9755

Diff:

Revision 5 (+156)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

File Captions:

gotool_success.png:	go_vet_success.png go_tool.png

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Let's pull the result of dest_file.lower() out into a variable so we don't have to call it twice in these checks.

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Blank line between statements and the start of new blocks.
```
bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Same here.
```

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

This will scan the entirety of packages every iteration of files. If packages is a set, this will be faster, though then we lose the ordering. Perhaps that's okay, and we can sort the results when iterating it?

If so, we can safely .add() into a set without needing to check existence first.

jblazusi Dec. 3, 2020, 4 p.m.

That's clever, I am not sure why I did not think of using a set before. Further down, we loop through every package so the order is not relevant.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

This can fail, so we'll want to check for exceptions, just as we do further down.

jblazusi Dec. 3, 2020, 4 p.m.

I totally agree with this. Although I think that it is worth mentioning that the other review bot tools do not have try/except blocks for the execute command.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

It's generally better to use %-formatted strings to join in variables, as this is faster in Python.

jblazusi Dec. 3, 2020, 4 p.m.

I did not know that there was a difference in performance, that is good to know.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

We prefer %-formatted strings, rather than .format(), as it's technically faster and more consistent with the rest of our codebase.
This can also be combiend with the previous line:

formatted_output = '[%s]' % ','.join(gotest_output)

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

When spanning lines, we prefer the % on the line with the variables, as it gives more room for the strings and helps more clearly indicate that we're formatting variables in.

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
No blank line here.
```

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Can you incorporate the package name in here, or something to help debug this if this comes up in production?

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Same as above regarding %-formatted strings.
```

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Let's use single quotes here, since the inner string doesn't use them. We prefer single quotes in Python strings wherever possible.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

If we compile this regex before we do the outer loop, it'll speed this part up.

jblazusi Dec. 3, 2020, 4 p.m.

I tried my best to fix this. However, I am not that familiar with regex, so I would appreciate it if you took another look at my change.

david Dec. 4, 2020, 4:49 p.m.
```
Looks like you did it correctly.
```

jblazusi Dec. 6, 2020, 11:40 a.m.

Perfect, I will go ahead and mark this issue as fixed.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

While common in Python, we shouldn't ever override _. That's because _ is commonly used as an alias to ugettext or ugettext_lazy, and this can have unintentional side-effects.

Also for items, we need to use six.iteritems() to get consistent behavior between Python 2 and 3.

jblazusi Dec. 3, 2020, 4 p.m.

I had no idea, thank you for letting me know. Is this common across most python programs, or just reviewboard in particular?
If we are not using the variable, such as the situation I am in, what should I use instead of _?

jblazusi Dec. 3, 2020, 7:48 p.m.

Christian answered this question in the slack:
"naming is better than not, so that it's self-documenting. Avoiding unpacking those variables is even better. Fewer things for Python to deal with, less a maintainer has to worry about"

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Rather than putting this into a variable and pulling the indexes out, let's just unpack with:

filename, line_num = \
    os.path.basename(key['posn']).split(':', 2)

Could you also put a comment above this showing the general format of what we should expect here, so it's documented?

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

You can put the format argument on the same line, since it fits. No need for parens there either, since we're not building a tuple (only one arg, and also (message) is equivalent to message since there's no tuple indicator, like a trailing comma or a second value).

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
No blank line here.
```
bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Same comment as above regarding having identifying information in the message.
```

Change Summary:

Addressed most of Christian's comments, though I would like some clarification on 2 of them.

Commit:

8ab7b4cf5335e42e78ad7e233b5972eeccaa9755

6641780e134a70fc7ea0765d6c8661b69905485a

People:

jace

Diff:

Revision 6 (+162)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Change Summary:

Addressed more comments by Christian, now only 1 comments requires a check.

Summary:

Go Tool for ReviewBot

Go Tool for Review Bot

Commit:

6641780e134a70fc7ea0765d6c8661b69905485a

7b4ed64bc45b060b596a5b9bdaee4fb9707924a7

Diff:

Revision 7 (+162)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

Things within here are getting pretty deeply nested. Maybe break out the test and vet implementations into their own methods?

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

This should be wrapped in a try/except.

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

Would it be possible to add a comment showing an example of what the JSON output from go vet looks like? That way people looking at the code can see how it maps to what it's parsing.

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

This is potentially running a lot of times, since it's deeply nested inside several loops.

At the top level, can we create a new dict that maps the patched file path back to f? That way we can just index into it instead of looping every time.

Change Summary:

Addressed David's comments and updated code to include sample JSON in the comments, as well as a dictionary to decrease execution time.

Commit:

7b4ed64bc45b060b596a5b9bdaee4fb9707924a7

2c4183cb42e072c7dd0948f330ce267a877fe84b

Diff:

Revision 8 (+192)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Change Summary:

Fixed logger.exception formatting.

Commit:

2c4183cb42e072c7dd0948f330ce267a877fe84b

61c94b09e77f853a401fdffd49c5726905b248a8

Diff:

Revision 9 (+192)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Ship it!

```
Ship It!
```

Status:: Completed
Change Summary:: Pushed to release-2.0.x (acb0cf2)