RoIHeads.postprocess_detections boxes slicing error occurs when removing predictions with the background label

### 🐛 Describe the bug

**Bug Report: Incorrect Box Slicing in Faster R-CNN's postprocess_detections**

### Minimal Reproduction Code
```python
import torch
import torchvision

detector = torchvision.models.detection.fasterrcnn_resnet50_fpn(pretrained=True)
data = torch.zeros((1, 3, 1080, 1920), dtype=torch.float32)
detections = detector(data)
```

### Description
The bug occurs in [`roi_heads.py` (line 701)](https://github.com/pytorch/vision/blob/main/torchvision/models/detection/roi_heads.py#L701) in the `postprocess_detections` function of `RoIHeads` when processing Faster R-CNN outputs. The current implementation incorrectly handles box dimension slicing when removing background class predictions.

### Problem Location
The problematic code segment:
```python
for boxes, scores, image_shape in zip(pred_boxes_list, pred_scores_list, image_shapes):
    ...
    # remove predictions with the background label
    boxes = boxes[:, 1:]  # Incorrect slicing
    scores = scores[:, 1:]
    labels = labels[:, 1:]
    ...
```

### Root Cause
1. The boxes tensor has shape `[N, num_classes * 4]` (where each class has 4 coordinate values)
2. The current slicing `boxes[:, 1:]` incorrectly operates on the last dimension (class*coordinates) instead of just the class dimension
3. This causes misalignment between boxes, scores, and labels since they're being sliced differently

![Image](https://github.com/user-attachments/assets/d1c0b97d-c873-469e-9cc6-5eb0a80e6765)

### Expected Behavior
The boxes tensor should first be reshaped to `[N, num_classes, 4]` before slicing to properly separate class and coordinate dimensions.

### Proposed Fix
```python
for boxes, scores, image_shape in zip(pred_boxes_list, pred_scores_list, image_shapes):
    ...
    # remove predictions with the background label
    boxes = boxes.reshape(-1, num_classes, 4)  # Proper dimension separation
    boxes = boxes[:, 1:, :]  # Correct class dimension slicing
    scores = scores[:, 1:]
    labels = labels[:, 1:]
    ...
```

### Impact
The current implementation leads to:
1. Misaligned boxes and their corresponding scores/labels
2. Potentially incorrect final detection results
3. Silent failure without explicit errors

### Versions

branch: 6473b779bdb8ba02bab0fc9e0f4ef4661ebb632a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RoIHeads.postprocess_detections boxes slicing error occurs when removing predictions with the background label #9110

🐛 Describe the bug

Minimal Reproduction Code

Description

Problem Location

Root Cause

Expected Behavior

Proposed Fix

Impact

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RoIHeads.postprocess_detections boxes slicing error occurs when removing predictions with the background label #9110

Description

🐛 Describe the bug

Minimal Reproduction Code

Description

Problem Location

Root Cause

Expected Behavior

Proposed Fix

Impact

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions