This change has been marked as completed.

Describe the completed change (optional):

Pushed to release-2.0.x (acb0cf2)

Summary

Go Tool for Review Bot

Review Request #11240 — Created Oct. 22, 2020 and submitted 4 years, 6 months ago

Information

Owner

jblazusi*

Repository

ReviewBot

Branch

master

Bugs

Depends On

Commit

61c94b0...

Reviewers

Groups

reviewbot, students

People

ceciliawei

Description*

This tool's ability to format was removed and placed into another tool,

that tool is tentatively known as the GofmtTool. This was done because,

go fmt is rather lightweight and does not rely on other files in order

to carry out its function. However, go test and go vet require more

information in order to work consistently. In particular they need

access to the entire repository so that patched code can be fully

analyzed against the package code. Therefore GofmtTool inherits from

Tool, whereas GoTool inherits from RepositoryTool and is generally

a much more computationally heavy tool.

Testing Done

Manual testing was done to confirm that go test is working correctly.
As of now it just creates general comments based on which tests have
failed.
Manual testing was done to confirm that go vet is working and known
issues about failures when used against test files have been resolved.

Files

go_tool.png

Issues

Description	From	Last Updated
Go/Go Tools?	david	4 years, 8 months ago
gofmt still exists, but I believe the modern recommendation is to use go fmt. Can we do that here to …	david	4 years, 8 months ago
Perhaps "This file contains formatting errors and should be run through go fmt" ? Also, shouldn't this be checking the …	david	4 years, 8 months ago
Since we have the output already in a variable, it's kind of silly to write it out to a file …	david	4 years, 8 months ago
The imports should be in alphabetical order. So import json should be placed before import logging.	ceciliawei	4 years, 7 months ago
Using the string's "split" method isn't portable (for example, windows doesn't use "/" as a path splitter). That said, there's …	david	4 years, 7 months ago
It looks like we're running this for every single file, which means that if you have multiple changed files in …	david	4 years, 7 months ago
Let's pull the result of dest_file.lower() out into a variable so we don't have to call it twice in these …	chipx86	4 years, 7 months ago
Blank line between statements and the start of new blocks.	chipx86	4 years, 7 months ago
Same here.	chipx86	4 years, 7 months ago
This will scan the entirety of packages every iteration of files. If packages is a set, this will be faster, …	chipx86	4 years, 7 months ago
This can fail, so we'll want to check for exceptions, just as we do further down.	chipx86	4 years, 7 months ago
It's generally better to use %-formatted strings to join in variables, as this is faster in Python.	chipx86	4 years, 7 months ago
We prefer %-formatted strings, rather than .format(), as it's technically faster and more consistent with the rest of our codebase. …	chipx86	4 years, 7 months ago
When spanning lines, we prefer the % on the line with the variables, as it gives more room for the …	chipx86	4 years, 7 months ago
No blank line here.	chipx86	4 years, 7 months ago
Can you incorporate the package name in here, or something to help debug this if this comes up in production?	chipx86	4 years, 7 months ago
Same as above regarding %-formatted strings.	chipx86	4 years, 7 months ago
Let's use single quotes here, since the inner string doesn't use them. We prefer single quotes in Python strings wherever …	chipx86	4 years, 7 months ago
If we compile this regex before we do the outer loop, it'll speed this part up.	chipx86	4 years, 7 months ago
While common in Python, we shouldn't ever override _. That's because _ is commonly used as an alias to ugettext …	chipx86	4 years, 7 months ago
Rather than putting this into a variable and pulling the indexes out, let's just unpack with: filename, line_num = \ …	chipx86	4 years, 7 months ago
You can put the format argument on the same line, since it fits. No need for parens there either, since …	chipx86	4 years, 7 months ago
No blank line here.	chipx86	4 years, 7 months ago
Same comment as above regarding having identifying information in the message.	chipx86	4 years, 7 months ago
Things within here are getting pretty deeply nested. Maybe break out the test and vet implementations into their own methods?	david	4 years, 7 months ago
This should be wrapped in a try/except.	david	4 years, 7 months ago
Would it be possible to add a comment showing an example of what the JSON output from go vet looks …	david	4 years, 7 months ago
This is potentially running a lot of times, since it's deeply nested inside several loops. At the top level, can …	david	4 years, 7 months ago
There are no open issues

flake8 passed.

JSHint passed.

Commit:

af6ed353931a4e3ddecaa739c08618f2e8d52532

4ad11e7957ebe6b29f6907d0eb8268814d2a4623

Diff:

Revision 2 (+69)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/gotool.py (Diff revision 2)

The issue has been resolved. Show all issues

Go/Go Tools?

jblazusi 4 years, 8 months ago

My mistake, I was not exactly sure whether they were separate or not at the time. It is fixed now.

bot/reviewbot/tools/gotool.py (Diff revision 2)

It looks like you're leaning towards making the different uses of go optional, which is good. We should probably default to true for everything except go test, which should default to false (test suites can often be cumbersome and long-running).

jblazusi 4 years, 8 months ago

That is a good idea, I will make sure to implement it that way.

Description:

~		The tool is currently only capable of checking whether a file is
~		correctly formatted or not based on `go fmt`. Work still needs to be
~		done to implement `go fix`, `go test`, and `go vet`.
	~	The tool is currently capable of checking whether a file is
	~	correctly formatted or not based on `go fmt`. As well as doing some
	~	static analysis using `go vet`, more time is needed to complete the
	+	`go test` feature.

Testing Done:

~		Manual testing was done to confirm that `go fmt` is working correctly.
	~	Manual testing was done to confirm that `go fmt` is working correctly.
	+	Manual testing was done to confirm that `go vet` is working in almost
	+	all cases. There are known issues right now with test files and correct
	+	adjustments to the code will be made in the next week.

Commit:

4ad11e7957ebe6b29f6907d0eb8268814d2a4623

3baa171c9d1df9e5b9533aac18e075a4582930f9

Diff:

Revision 3 (+110)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Added Files:: Review

Download

go_vet_success.png

bot/reviewbot/tools/gotool.py (Diff revision 3)

The issue has been resolved. Show all issues

gofmt still exists, but I believe the modern recommendation is to use go fmt. Can we do that here to be more future-proof?

jblazusi 4 years, 8 months ago

I believe that this is generally true, however, it is important to note that go fmt runs gofmt -l -w. The -w flag is responsible for overwriting the file, although I do not think that this is an issue, especially since patched files are temporary. But I do think that it is worth noting in case, whomever takes over the project is experiencing issues relating to overwriting a file.

In short, I have updated the command to use go fmt and this can be found in the new GofmtTool CR.

bot/reviewbot/tools/gotool.py (Diff revision 3)

The issue has been resolved. Show all issues

Perhaps "This file contains formatting errors and should be run through go fmt" ?
Also, shouldn't this be checking the output or return value from the format command? Seems like it just unconditionally adds the comment now (maybe that's still part of the WIP?)

jblazusi 4 years, 8 months ago

I addressed the message in the new GofmtTool. I originally did not setup conditionals, since I wanted to make sure I was getting the correct output, so this is an artifact from early testing.

bot/reviewbot/tools/gotool.py (Diff revision 3)

The issue has been resolved. Show all issues

Since we have the output already in a variable, it's kind of silly to write it out to a file and load it back in. We can just do:

try:
    json_data = json.loads(cleaned_output)
except Exception as e:
    ...

jblazusi 4 years, 8 months ago

This is definitely a blunder. I was a so caught up in using files from previous tools, that I completely forgot about just passing in the output string.

Summary:

[WIP] Go Tool for ReviewBot

Go Tool for ReviewBot

Description:

~		The tool is currently capable of checking whether a file is
~		correctly formatted or not based on `go fmt`. As well as doing some
~		static analysis using `go vet`, more time is needed to complete the
~		`go test` feature.
	~	This tool's ability to format was removed and placed into another tool,
	~	that tool is tentatively known as the GofmtTool. This was done because,
	~	`go fmt` is rather lightweight and does not rely on other files in order
	~	to carry out its function. However, `go test` and `go vet` require more
	+	information in order to work consistently. In particular they need
	+	access to the entire repository so that patched code can be fully
	+	analyzed against the package code. Therefore GofmtTool inherits from
	+	`Tool`, whereas GoTool inherits from `RepositoryTool` and is generally
	+	a much more computationally heavy tool.

Testing Done:

~		Manual testing was done to confirm that `go fmt` is working correctly.
~		Manual testing was done to confirm that `go vet` is working in almost
~		all cases. There are known issues right now with test files and correct
~		adjustments to the code will be made in the next week.
	~	Manual testing was done to confirm that `go test` is working correctly.
	~	As of now it just creates general comments based on which tests have
	~	failed.
	~	Manual testing was done to confirm that `go vet` is working and known
	+	issues about failures when used against test files have been resolved.

Commit:

3baa171c9d1df9e5b9533aac18e075a4582930f9

61427f66214d6fa076bdeb38f115e9662266f7dc

Diff:

Revision 4 (+138)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Added Files:: Review

Download

go_tool.png

bot/reviewbot/tools/gotool.py (Diff revision 4)

The issue has been resolved. Show all issues

The imports should be in alphabetical order. So import json should be placed before import logging.

jblazusi 4 years, 7 months ago

Thank you, I will make sure I make this update in previous and future CRs.

bot/reviewbot/tools/gotool.py (Diff revision 4)

The issue has been resolved. Show all issues

Using the string's "split" method isn't portable (for example, windows doesn't use "/" as a path splitter). That said, there's both a portable and easier way to do this:

package = os.path.dirname(path)

jblazusi 4 years, 7 months ago

I was not aware of this being a portability issue, thank you so much for the advice. I have updated my code to use the os.path module in other areas as well.

bot/reviewbot/tools/gotool.py (Diff revision 4)

The issue has been resolved. Show all issues

It looks like we're running this for every single file, which means that if you have multiple changed files in a given package, we'll add the test failures multiple times.

Instead of overriding handle_file, can we instead override handle_files, and use the file list to build a list of changed packages? We can then run the tests once per package.

jblazusi 4 years, 7 months ago

Excellent, this is exactly what I had in mind when I was optimizing my FBInfer Tool. However, the advice of building a list of changed packages was a very useful starting point.

Commit:

61427f66214d6fa076bdeb38f115e9662266f7dc

8ab7b4cf5335e42e78ad7e233b5972eeccaa9755

Diff:

Revision 5 (+156)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

File Captions:

gotool_success.png:	go_vet_success.png go_tool.png

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Let's pull the result of dest_file.lower() out into a variable so we don't have to call it twice in these checks.

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Blank line between statements and the start of new blocks.
```
bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Same here.
```

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

This will scan the entirety of packages every iteration of files. If packages is a set, this will be faster, though then we lose the ordering. Perhaps that's okay, and we can sort the results when iterating it?

If so, we can safely .add() into a set without needing to check existence first.

jblazusi 4 years, 7 months ago

That's clever, I am not sure why I did not think of using a set before. Further down, we loop through every package so the order is not relevant.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

This can fail, so we'll want to check for exceptions, just as we do further down.

jblazusi 4 years, 7 months ago

I totally agree with this. Although I think that it is worth mentioning that the other review bot tools do not have try/except blocks for the execute command.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

It's generally better to use %-formatted strings to join in variables, as this is faster in Python.

jblazusi 4 years, 7 months ago

I did not know that there was a difference in performance, that is good to know.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

We prefer %-formatted strings, rather than .format(), as it's technically faster and more consistent with the rest of our codebase.
This can also be combiend with the previous line:

formatted_output = '[%s]' % ','.join(gotest_output)

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

When spanning lines, we prefer the % on the line with the variables, as it gives more room for the strings and helps more clearly indicate that we're formatting variables in.

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
No blank line here.
```

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Can you incorporate the package name in here, or something to help debug this if this comes up in production?

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Same as above regarding %-formatted strings.
```

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Let's use single quotes here, since the inner string doesn't use them. We prefer single quotes in Python strings wherever possible.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

If we compile this regex before we do the outer loop, it'll speed this part up.

jblazusi 4 years, 7 months ago

I tried my best to fix this. However, I am not that familiar with regex, so I would appreciate it if you took another look at my change.

david 4 years, 7 months ago
```
Looks like you did it correctly.
```

jblazusi 4 years, 7 months ago

Perfect, I will go ahead and mark this issue as fixed.

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

While common in Python, we shouldn't ever override _. That's because _ is commonly used as an alias to ugettext or ugettext_lazy, and this can have unintentional side-effects.

Also for items, we need to use six.iteritems() to get consistent behavior between Python 2 and 3.

jblazusi 4 years, 7 months ago

I had no idea, thank you for letting me know. Is this common across most python programs, or just reviewboard in particular?
If we are not using the variable, such as the situation I am in, what should I use instead of _?

jblazusi 4 years, 7 months ago

Christian answered this question in the slack:
"naming is better than not, so that it's self-documenting. Avoiding unpacking those variables is even better. Fewer things for Python to deal with, less a maintainer has to worry about"

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

Rather than putting this into a variable and pulling the indexes out, let's just unpack with:

filename, line_num = \
    os.path.basename(key['posn']).split(':', 2)

Could you also put a comment above this showing the general format of what we should expect here, so it's documented?

bot/reviewbot/tools/gotool.py (Diff revision 5)

The issue has been resolved. Show all issues

You can put the format argument on the same line, since it fits. No need for parens there either, since we're not building a tuple (only one arg, and also (message) is equivalent to message since there's no tuple indicator, like a trailing comma or a second value).

bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
No blank line here.
```
bot/reviewbot/tools/gotool.py (Diff revision 5)
The issue has been resolved. Show all issues
```
Same comment as above regarding having identifying information in the message.
```

Change Summary:

Addressed most of Christian's comments, though I would like some clarification on 2 of them.

Commit:

8ab7b4cf5335e42e78ad7e233b5972eeccaa9755

6641780e134a70fc7ea0765d6c8661b69905485a

People:

jace

Diff:

Revision 6 (+162)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Change Summary:

Addressed more comments by Christian, now only 1 comments requires a check.

Summary:

Go Tool for ReviewBot

Go Tool for Review Bot

Commit:

6641780e134a70fc7ea0765d6c8661b69905485a

7b4ed64bc45b060b596a5b9bdaee4fb9707924a7

Diff:

Revision 7 (+162)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

Things within here are getting pretty deeply nested. Maybe break out the test and vet implementations into their own methods?

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

This should be wrapped in a try/except.

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

Would it be possible to add a comment showing an example of what the JSON output from go vet looks like? That way people looking at the code can see how it maps to what it's parsing.

bot/reviewbot/tools/gotool.py (Diff revision 7)

The issue has been resolved. Show all issues

This is potentially running a lot of times, since it's deeply nested inside several loops.

At the top level, can we create a new dict that maps the patched file path back to f? That way we can just index into it instead of looping every time.

Change Summary:

Addressed David's comments and updated code to include sample JSON in the comments, as well as a dictionary to decrease execution time.

Commit:

7b4ed64bc45b060b596a5b9bdaee4fb9707924a7

2c4183cb42e072c7dd0948f330ce267a877fe84b

Diff:

Revision 8 (+192)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Change Summary:

Fixed logger.exception formatting.

Commit:

2c4183cb42e072c7dd0948f330ce267a877fe84b

61c94b09e77f853a401fdffd49c5726905b248a8

Diff:

Revision 9 (+192)

Show changes

	bot/setup.py
	bot/reviewbot/tools/gotool.py

Checks run (2 succeeded)

flake8 passed.

JSHint passed.

Ship it!

```
Ship It!
```

Status:: Completed
Change Summary:: Pushed to release-2.0.x (acb0cf2)