Implementation status dashboard #6075

chriscool · 2019-03-13T14:04:09Z

The goal of this PR is to make it possible to have an implementation status dashboard for any IPFS implementation, as described in: ipfs-inactive/ipfs-sharness-tests#6

More concretely the goal is to get something like for example:

An example of a resulting dashboard produced by code in this PR is:

https://gist.github.com/chriscool/1c07b5944a6fc058cf677a04fec62eec

This is done by:

using an existing coverage script that can output a summary of which ipfs comand with which options are used in which Sharness test scripts,
using output from running the Sharness test scripts with prove, as prove can write .prove files that tells which tests passed or failed
combining coverage output with .prove output to generate a status*.txt file
combining many status*.txt files into a dashboard using the markdown format

The main way to concretely perform the above steps is to use the Makefile provided by this PR.

cc @scout @daviddias

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

chriscool · 2019-03-18T17:11:02Z

@Kubuxu and @daviddias are you ok to review this? Or do you think someone else should review?

Note that I just added a commit to improve the dashboard output (https://gist.github.com/chriscool/1c07b5944a6fc058cf677a04fec62eec) by using the following legend items in the output:

🍏 Passed 🍋 Didn't run 🍅 Failed 🌰 No test implemented

Stebalien

So, the output looks nice but there's a lot of cryptic, uncommented, text-munching shell and perl scripts in this PR.

Could we consider something like JUnit instead of prove? We already patch sharness to support JUnit.
Perl is, for better or worse, notoriously hard to read and a dying language (for that reason). This is going to be hard to maintain. If we're going to keep the perl, we're going to need to document it thoroughly.
Instead of parsing the text to figure out which commands we've run, we should probably just inject a fake IPFS binary that logs commands to a file before calling the actual ipfs command. That's going to be a lot simpler, more reliable, and easier to maintain.
What's our end-goal here? Do we just want a list of supported commands? Sharness isn't going to be able to really give us any reliable information on "in progress" and we should be able to get everything else by by asking js-ipfs/go-ipfs. I guess this last question is for @daviddias.

test/sharness/status/results/status_new.md

test/sharness/status/README.md

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

chriscool · 2019-03-20T16:20:39Z

First some background (that I should have added in the description of this PR):

The scripts in this PR are part of the Sharness tests. In fact they could perhaps even be moved to Sharness itself as they are not specific to IPFS. And I think it's better to keep them this way as much as possible.

In fact it is already planned to have all the Sharness tests in a separate repo (https://github.com/ipfs/ipfs-sharness-tests/) and to have this repo be a submodule in the go-ipfs and js-ipfs repos.

So, the output looks nice but there's a lot of cryptic, uncommented, text-munching shell and perl scripts in this PR.

Yeah, I will try to comment the Perl code more. I tried to reduce the amount of Perl code, but for some things like hash maps and transforms using regexps it's much simpler than doing it in shell (especially if we want it to run with many different shells not just bash).

I could also have used sed instead of Perl for regexps but this would not likely have made things more readable.

Could we consider something like JUnit instead of prove? We already patch sharness to support JUnit.

I don't know the advantage of JUnit over prove and where this patching is done and for which purpose. Anyway prove is part of the Perl ecosystem and Perl is part of the Git/Sharness ecosystem. So using prove and Perl is more likely to run out of the box everywhere Git and Sharness are installed.

Perl is, for better or worse, notoriously hard to read and a dying language (for that reason). This is going to be hard to maintain. If we're going to keep the perl, we're going to need to document it thoroughly.

Ok to improve documentation and comments in some subsequent commits.

Perl is still used a lot for admin scripts especially when something portable is required. Also the scripts in this PR are likely to be maintained by infrastructure team which is more likely to be familiar with Perl than JUnit.

Instead of parsing the text to figure out which commands we've run, we should probably just inject a fake IPFS binary that logs commands to a file before calling the actual ipfs command. That's going to be a lot simpler, more reliable, and easier to maintain.

If the IPFS Sharness tests are used not just to test go-ipfs but also to test other implementations that are just starting to being developed, then calling the actual ipfs command that is tested by the Sharness tests is very likely to fail very soon in many test, which means that the following instructions in the tests won't be run (as most tests consists in ipfs commands chained using &&). So this would result in the log missing many calls to ipfs commands and the report being very incomplete.

If we decide to inject a fake ipfs binary that logs command and then always calls the go-ipfs binary instead of the actual ipfs command that is tested, this could work better, but this would depend on always having and carrying around a good, and as up-to-date as possible, go-ipfs binary for that purpose, which is a significant maintenance issue.

What's our end-goal here? Do we just want a list of supported commands? Sharness isn't going to be able to really give us any reliable information on "in progress" and we should be able to get everything else by by asking js-ipfs/go-ipfs. I guess this last question is for @daviddias.

I am not sure what "in progress" and "asking js-ipfs/go-ipfs" mean here.

Anyway I thought that we wanted something like https://github.com/ipfs/ipfs/blob/master/IMPLEMENTATION_STATUS.md and I think that this PR gives something like that automatically with a bit more details.

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

Stebalien · 2019-03-21T00:22:46Z

I don't know the advantage of JUnit over prove and where this patching is done and for which purpose. Anyway prove is part of the Perl ecosystem and Perl is part of the Git/Sharness ecosystem. So using prove and Perl is more likely to run out of the box everywhere Git and Sharness are installed.

We patch it here:

https://github.com/ipfs/go-ipfs/blob/34b85d3b08492ba7d6e31489ad4bbff58482fc6d/test/sharness/lib/install-sharness.sh#L32

We can use prove, but having to include a custom prove file parser is icky. The advantage of JUnit is:

We're already using it.
We can probably use a standard tool for parsing the output instead of including a bunch of bash scripts with regexs.

The scripts in this PR are part of the Sharness tests. In fact they could perhaps even be moved to Sharness itself as they are not specific to IPFS. And I think it's better to keep them this way as much as possible.

Could we try upstreaming them? That would make me feel much better about this.

If the IPFS Sharness tests are used not just to test go-ipfs but also to test other implementations that are just starting to being developed, then calling the actual ipfs command that is tested by the Sharness tests is very likely to fail very soon in many test, which means that the following instructions in the tests won't be run (as most tests consists in ipfs commands chained using &&). So this would result in the log missing many calls to ipfs commands and the report being very incomplete.

I see. So you want to generate a list of ipfs commands we could test to compare it with a list of commands that actually work. We could also use ipfs commands --flags but that does require a go-ipfs binary (and won't show any commands that, e.g., js-ipfs has but js-ipfs doesn't).

I am not sure what "in progress" and "asking js-ipfs/go-ipfs" mean here.

"in progress": I'm refering to the "In Progress" status of the IPFS implementation status page.
"asking js-ipfs/go-ipfs": I mean a command like ipfs commands --flags but for js-ipfs.

Unless I'm mistaken, the point here is to generate a list of things that:

Aren't implemented.
Are partially implemented.
Are fully implemented.

We can get 1 versus 3 by comparing lists of supported commands. I'm guessing these scripts currently try to get (2) running the sharness tests and looking for commands that claim to be supported but aren't.

chriscool · 2019-04-01T09:12:47Z

About JUnit, I would be ok with adding a JUnit output, but I don't think the way it is patched right now is very nice. It should probably be up-streamed into Sharness too.

Also the test output from Sharness is in the TAP (Test Anything Protocol https://testanything.org/) format, so a TAP to JUnit converter could be a completely separate project that we could just use instead of patching Sharness.

About prove, it has some interesting options like -j 4 to run tests in parallel. I am not sure it is very usable right now for go-ipfs, but when I run many Git tests I always use that as it speeds up things significantly. I think we will want something like that at one point. The output from prove is a very simple text file that is very easy to parse or to convert to something else.

Could we try upstreaming them? That would make me feel much better about this.

Yeah, I think it would be a good idea to upstream the scripts in this PR.

Unless I'm mistaken, the point here is to generate a list of things that:

Aren't implemented.

Are partially implemented.

Are fully implemented.

We can get 1 versus 3 by comparing lists of supported commands. I'm guessing these scripts currently try to get (2) running the sharness tests and looking for commands that claim to be supported but aren't.

With the result from the scripts in this PR we can easily see the following:

which ipfs commands and options are not covered by Sharness tests
the number of tests passing/failing/not run for each ipfs command and option

And I think this is enough to get a good idea about the overall progress of an ipfs implementation and the Sharness test infrastructure.

Deciding between "not" or "partially" or "fully" implemented cannot really be the goal right now because for example:

if an ipfs command "foo" is not implemented at all, then ipfs commands --flags will not report it and it is likely to have no test, so we cannot even know it exists
if it is reported by ipfs commands --flags but it does nothing or nearly nothing, and has no test, how is it different from a command which is fully implemented but also has no test

If all the tests were implemented and could easily be run for all the commands we want, and if they were all reported by ipfs commands --flags, whether they are implemented or not, then we could get a "not" or "partially" or "fully" implemented status for each command with the current scripts because the number of tests passing/failing for each ipfs command and option would easily give us a ratio of how much each command and option is working.

magik6k · 2019-04-01T10:05:23Z

Also the test output from Sharness is in the TAP (Test Anything Protocol https://testanything.org/) format, so a TAP to JUnit converter could be a completely separate project that we could just use instead of patching Sharness.

There are reasons I created this (admittedly hacky) patch vs just using a TAP converter:

TAP is missing timing information, which is very useful when looking for slow tests
output with TEST_VERBOSE=1 couldn't be parsed by any converter I tried
Test stderr is not really possible to parse

(2 last points may be bugs / wrong implementation in sharness)

chriscool · 2019-04-01T17:17:05Z

About timing information, the Git test framework (from which Sharness was initialy created) has an extension with a lot of functionality around performance tests. It can even send performance test results to a codespeed server (https://github.com/tobami/codespeed). If we really need performance tests, it could be worth it to port the extension to Sharness.

I agree that it should be possible to fix the last 2 points and there is now --verbose-log that might help.

chriscool · 2019-04-01T17:18:39Z

See https://github.com/git/git/tree/master/t/perf for the performance test extension in Git.

Stebalien · 2019-04-01T22:21:38Z

With the result from the scripts in this PR we can easily see the following:

which ipfs commands and options are not covered by Sharness tests

the number of tests passing/failing/not run for each ipfs command and option

And I think this is enough to get a good idea about the overall progress of an ipfs implementation and the Sharness test infrastructure.

Ok, but that's definitely not what ipfs-inactive/ipfs-sharness-tests#6 is asking for which is why I was so confused.

But yeah, I see how this is an implementation status. However, I believe we'll need to be able to set expected failures based on the go-ipfs implementation we're testing instead of the current test_expect_success/test_expect_failure.

Deciding between "not" or "partially" or "fully" implemented cannot really be the goal right now because for example:

We could combine a list of commands supported by all implementations and then just assume that if the flag exists, it's supported. It's not perfect but, as you say, tests aren't going to give us any better information.

As I see it, the TODOs are:

Add a bunch of comments to the new scripts explaining what they're doing and how. Especially in implementation_status.sh.
If possible, upstream as much code as possible.

chriscool · 2019-04-02T13:32:29Z

Ok, I will work on the TODOs you listed. Thanks!

chriscool requested a review from Kubuxu as a code owner March 13, 2019 14:04

ghost assigned chriscool Mar 13, 2019

ghost added the status/in-progress In progress label Mar 13, 2019

chriscool added 19 commits March 13, 2019 17:53

Add sharness_dot_prove_parser.sh

8021a39

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness/.gitignore: ignore .prove

ed273bb

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

Add sharness_implementation_status.sh

1a43136

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness_implementation_status: print markdown

e88c2af

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

Add sharness_status.sh

e6dcf23

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness_status: add coverage code

977b90d

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness_implementation_status: end file with newline

a627d13

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness: move coverage and status scripts into 'sharness/status/'

4e3df69

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

coverage_helper: rename and fix helper script

b780530

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

dot_prove_parser: rename and fix helper script

ef69a2b

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

implementation_status: rename and fix helper script

108ffec

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness/status: add Makefile and README.md

0d423d0

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

implementation_status: use existing coverage and dot prove files

5207d30

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness/status: create markdown_status.sh

40c2182

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

markdown_status: move code to implementation_status.sh

4259bb8

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

markdown_status: process many files

d538be1

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness/status/results: add example result page

badc9de

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

implementation_status: use / to separate options

49bc2f4

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness/status: add 'dashboard' and 'add' targets

630595f

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

chriscool force-pushed the implementation-status3 branch from 4912a07 to 630595f Compare March 13, 2019 16:57

scout requested a review from daviddias March 13, 2019 17:05

markdown_status: use images in output

2ce8ac8

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

Kubuxu requested a review from magik6k March 18, 2019 17:30

Stebalien reviewed Mar 18, 2019

View reviewed changes

test/sharness/status/results/status_new.md Outdated Show resolved Hide resolved

test/sharness/status/README.md Outdated Show resolved Hide resolved

sharness/status/README: fix IPFS typo

9d52acb

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

chriscool added 2 commits March 20, 2019 17:46

sharness/status: remove unused sharness_status.sh

dbf3536

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

sharness/status: remove useless results/status_new.md

a5c318f

License: MIT Signed-off-by: Christian Couder <chriscool@tuxfamily.org>

Stebalien added the need/author-input Needs input from the original author label Apr 26, 2019

momack2 added this to In Progress in ipfs/go-ipfs May 9, 2019

Stebalien mentioned this pull request Jul 3, 2019

Testing Improvement Plan #6483

Open

43 tasks

Stebalien unassigned chriscool Apr 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation status dashboard #6075

Implementation status dashboard #6075

Implementation status dashboard #6075

Are you sure you want to change the base?

Implementation status dashboard #6075

Conversation

Choose a reason for hiding this comment