Add support for loading segmentation datasets in Pascal VOC format #245

kirilllzaitsev · 2023-07-26T08:25:02Z

Description

Add possibility to load PASCAL VOC segmentation masks in addition to object detection (related issue).
Changes were made to core.DetectionDataset.from_pascal_voc and format.pascal_voc.load_pascal_voc_annotations.

Type of change

Please delete options that are not relevant.

[+] New feature (non-breaking change which adds functionality)

How has this change been tested, please provide a testcase or example of how you tested the change?

Via new tests.

Any specific deployment considerations

For example, documentation changes, usability, usage/costs, secrets, etc.

Docs

Docs updated? What were the changes:

kirilllzaitsev · 2023-07-26T08:31:58Z

@SkalskiP , new implementation is not compatible with v1 of load_pascal_voc_annotations, but matches the loading standard of YOLO and COCO. Shall we temporarily leave the v1 there and issue a warning, or change the implementation to be backward compatible? Is there a general rule that we follow for such cases?

supervision/dataset/formats/pascal_voc.py

supervision/dataset/core.py

test/dataset/formats/test_pascal_voc.py

supervision/dataset/formats/pascal_voc.py

SkalskiP · 2023-07-28T13:46:16Z

new implementation is not compatible with v1 of load_pascal_voc_annotations, but matches the loading standard of YOLO and COCO. Shall we temporarily leave the v1 there and issue a warning, or change the implementation to be backward compatible? Is there a general rule that we follow for such cases?

@kirilllzaitsev don't worry about breaking old load_pascal_voc_annotations. It is a function that is not publicly exposed.

…d approx of masks should be in another test suite

kirilllzaitsev · 2023-07-29T21:40:15Z

@SkalskiP , hi, ready for review.

SkalskiP · 2023-07-31T10:41:18Z

Hi, @kirilllzaitsev 👋🏻! I've done some tests, and I see some problems:

from_pascal_voc loads mask data even if force_masks=False.
mask.shape have incorrect order of channels. It is (N, W, H). It should be (N, H, W).
mask.dtype is incorrect. It is uint8. It should be bool.

Here is a Google Colab you can use to verify my findings: https://colab.research.google.com/drive/18rDUnAwPxhMt9YVbnFof5MS6__59NL11?usp=sharing

I'm not sure yet what to do with force_masks. I see there is inconsistency across different format loaders.

supervision/dataset/formats/pascal_voc.py

SkalskiP · 2023-07-31T11:16:58Z

@kirilllzaitsev let me know when you'll be ready ;)

kirilllzaitsev · 2023-07-31T11:22:23Z

Hi, @kirilllzaitsev 👋🏻! I've done some tests, and I see some problems:

from_pascal_voc loads mask data even if force_masks=False.

mask.shape have incorrect order of channels. It is (N, W, H). It should be (N, H, W).

mask.dtype is incorrect. It is uint8. It should be bool.

Here is a Google Colab you can use to verify my findings: https://colab.research.google.com/drive/18rDUnAwPxhMt9YVbnFof5MS6__59NL11?usp=sharing

I'm not sure yet what to do with force_masks. I see there is inconsistency across different format loaders.

To me, setting the force_masks flag translates into 'masks are required, and if there are no masks - raise an error'. This is what the YOLO loader does, contrary to the COCO that uses force_masks as an indicator of whether to use masks or not. The latter seems confusing to me.

kirilllzaitsev · 2023-07-31T11:23:19Z

@SkalskiP , ready

SkalskiP · 2023-07-31T12:32:49Z

To me, setting the force_masks flag translates into 'masks are required, and if there are no masks - raise an error'. This is what the YOLO loader does, contrary to the COCO that uses force_masks as an indicator of whether to use masks or not. The latter seems confusing to me.

I think that our current API sucks. force_masks is completely non-intuitive. And of course, this is my fault. :) I just think about how to fix it.

Things we can do:

Keep force_masks but make this behavior consistent for all formats.
Have separate methods for detection and segmentation. For example from_coco and from_coco_cegemtnation.

@hardikdava what do you think?

hardikdava · 2023-07-31T12:42:26Z

@SkalskiP , I completely agree with you. We can introduce sv.VisionTask as argument to sv.DetectionDataset for scalable and specifying types which can be standardize.

class VisionTask(Enum):
	CLASSIFICATION = 0
	OBJECT_DETECTION = 1
	ORIENTED_BOUNDING_BOX = 2
	INSTANCE_SEGMENTATION = 3
	KEYPOINTS_DETECTION = 4
	POSE_ESITMATION = 5

what do you think?

SkalskiP · 2023-07-31T15:37:18Z

test/dataset/formats/test_pascal_voc.py

+):
+    with exception:
+        result = object_to_pascal_voc(xyxy=xyxy, name=name, polygon=polygon)
+        with open("/tmp/test.xml", "w") as f:


@kirilllzaitsev, what are we testing here? 👇🏻

with open("/tmp/test.xml", "w") as f: f.write(ET.tostring(result).decode()) with open("/tmp/exptest.xml", "w") as f: f.write(ET.tostring(expected_result).decode())

To be honest, I'd drop those four lines.

An artifact from local tests, apologies

test/dataset/formats/test_pascal_voc.py

SkalskiP · 2023-07-31T15:47:34Z

supervision/dataset/formats/pascal_voc.py

+    return polygon_points
+
+
+def load_pascal_voc_annotations_v1(


Let's drop that logic altogether.

SkalskiP · 2023-07-31T15:50:10Z

@kirilllzaitsev I left a few more final comments. But we are definitely on the right path.

Also, please makes sure to run 👇🏻 before the final commit.

isort --profile black supervision/
black supervision

kirilllzaitsev · 2023-07-31T18:12:46Z

@SkalskiP ready

SkalskiP · 2023-07-31T22:32:59Z

Looks good to me. Merging! @kirilllzaitsev you plan to work on anything more in this release?

kirilllzaitsev · 2023-08-01T06:20:24Z

@SkalskiP sure, what else is on the plate?

SkalskiP · 2023-08-01T09:00:30Z

@kirilllzaitsev, how about splitting DetectionDataset into DetectionDataset and SegmentationDataset POC? #244

kirilllzaitsev added 2 commits July 26, 2023 10:20

add load_pascal_voc_annotations v2

d7e08d4

update from_pascal_voc to match v2 loader

3723eae

kirilllzaitsev added 3 commits July 26, 2023 10:39

add test_pascal template

86b33ed

update docstrings

a911183

move fixing of class_ids to load_pascal_voc_annotations

6dde70e

mayankagarwals reviewed Jul 26, 2023

View reviewed changes

supervision/dataset/formats/pascal_voc.py Outdated Show resolved Hide resolved

import polygon_to_mask from supervision

277a9cb

SkalskiP requested changes Jul 28, 2023

View reviewed changes

kirilllzaitsev added 9 commits July 28, 2023 22:21

fixes patch

b0e0d07

fix with_masks defined but not used

0a1212f

add test_object_to_pascal_voc

a181826

fix registering of empty detection

446ae23

refactor class_id assignment to Detections in VOC

9a9d0c5

remove test_detections_to_pascal_voc. conversion to XML is tested, an…

1febd71

…d approx of masks should be in another test suite

add test_parse_polygon_points

1642432

add test_detections_from_xml_obj

e4ef57e

add docstrings

f8005b2

kirilllzaitsev added 2 commits July 29, 2023 23:41

cleanup imports

e8f616b

upd docstring

12b3def

SkalskiP requested changes Jul 31, 2023

View reviewed changes

supervision/dataset/formats/pascal_voc.py Outdated Show resolved Hide resolved

supervision/dataset/formats/pascal_voc.py Outdated Show resolved Hide resolved

supervision/dataset/formats/pascal_voc.py Outdated Show resolved Hide resolved

kirilllzaitsev added 3 commits July 31, 2023 13:09

fix mask shape (N, W, H) -> (N, H, W)

95c8576

fix mask dtype

77c2afe

cast masks to bool only when creating a Detection obj

9462a30

SkalskiP assigned kirilllzaitsev Jul 31, 2023

SkalskiP added enhancement New feature or request api:datasets Dataset API version: 0.13.0 Feature to be added in `0.13.0` release labels Jul 31, 2023

SkalskiP added this to the version: 0.13.0 milestone Jul 31, 2023

lint

c0c292e

SkalskiP requested changes Jul 31, 2023

View reviewed changes

kirilllzaitsev added 3 commits July 31, 2023 19:42

remove artifacts

e2d1fe7

drop load_pascal_voc_annotations_v1

26289e3

extend tests for test_detections_from_xml_obj

6fdabf0

SkalskiP approved these changes Jul 31, 2023

View reviewed changes

SkalskiP merged commit d6d4760 into roboflow:develop Jul 31, 2023
4 checks passed

mayankagarwals mentioned this pull request Aug 2, 2023

Fix Pascal VOC Offset #235

Merged

3 tasks

SkalskiP mentioned this pull request Aug 16, 2023

Add support for loading segmentation datasets in Pascal VOC format #244

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for loading segmentation datasets in Pascal VOC format #245

Add support for loading segmentation datasets in Pascal VOC format #245

Add support for loading segmentation datasets in Pascal VOC format #245

Add support for loading segmentation datasets in Pascal VOC format #245

Conversation

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment