
- YOLO(You Only Look Once)

1. Devide the cell to the grid evenly
2. The grid cell which has the center of the object is assigned to detect that object(two red dots on the rightmost pic).

3. Each grid cell predict the bounding box and confidence score of bounding box. (Higher confidence score, thicker bounding box (The picture above (4)) ).
4. At the same time, classification is carried out simultaneously.
5. Leaves thick boxes only and erases thin boxes, which are less likely to have objects.
6. NMS(Non-Maximum Suppression)
- Confidence Score
The number that whether the bounding box contains objects or not, and how accurately the box predicts the ground truth box.
If there is an object in the grid cell, the confidence score is equal to the IoU value.
Conversely, if there is no object in the grid cell, the confidence score will be zero.
- Predictions on a grid
Bounding boxes are similar to anchor boxes in Faster RCNN.
They consists in five prediction-4 spatial coordinates, one objectness score.
A single cell contain only one object.

- Which class the object belongs to
- The likelihood that a grid cell contains an object
- Four bounding box descriptors to describe the
Example
- First bounding box


1. Devide the cell to 4*4 grid evenly
2. Each cell has 2 bounding boxes
3. Assume that there are 20 classes
4. Output(5 values)
- x, y : Center coordinate of predicted bounding box
- w, h : Normarlized width, height
- Pc : Confidence score
- Second bounding box.

Put the 5 values into an one-dimensional tensor.

The full output of applying

- Output tensor of 2 bounding boxes

- Output tensor of multiple bounding boxes

- Training Stage


1) Responsible cell(sky blue cell) for a specific object is assigned to the cell where the center coordinates of the ground truth box are located.
2) In the sky blue cell, select one bounding box based on the IOU value.
3) Calculate loss function between selected box(from 2)) and ground truth box.
- SSE(Sum of Squared Error)
Loss function for YOLO v1, sum of Localization loss, Confidence loss, Classification loss.
- Localization loss

-
-
-
-
-
-
- Confidence loss

-
-
-
-
- Classification loss

-
-
https://dotiromoook.tistory.com/24
'Deep Learning > Object Detection' 카테고리의 다른 글
Image Intensity, Canny (0) | 2024.05.28 |
---|---|
Object Detection, mAP(The Mean Average Precision) (0) | 2024.02.16 |
(prerequisite-YOLO) One Stage Object Detection (0) | 2024.02.11 |
(prerequisite-YOLO) DarkNet (0) | 2024.02.11 |
Faster RCNN (0) | 2024.01.30 |