YOLOv5 분석(2) - Train Custom Data

YOLOv5

YOLOv5 분석(2) - Train Custom Data

dhpark 2022. 2. 3. 00:24

출처: https://docs.ultralytics.com/tutorials/train-custom-datasets/

Train Custom Data 📌 - YOLOv5 Documentation

Train Custom Data 📌 📚 This guide explains how to train your own custom dataset with YOLOv5 🚀. Before You Start Clone this repo, download tutorial dataset, and install requirements.txt dependencies, including Python>=3.8 and PyTorch>=1.7. $ git clo

docs.ultralytics.com

위의 사이트를 참고하여 YOLOv5를 커스텀 데이터로 학습하는 방법을 간략하게 알아보겠습니다.

Train Custom Data

1. Create dataset.yaml

COCO128은 COCO train2017의 일부분으로 Training, Validation 데이터로 나뉘어져 있으며, 학습 과정이 잘 동작하는지 테스트하기에 적합합니다(학습 목적이 아닙니다)

data/coco128.yaml 파일

# train and val data as 1) directory: path/images/, 2) file: path/images.txt, or 3) list: [path1/images/, path2/images/]
train: ../coco128/images/train2017/
val: ../coco128/images/train2017/

# number of classes
nc: 80

# class names
names: ['person', 'bicycle', 'car', 'motorcycle', 'airplane', 'bus', 'train', 'truck', 'boat', 'traffic light',
        'fire hydrant', 'stop sign', 'parking meter', 'bench', 'bird', 'cat', 'dog', 'horse', 'sheep', 'cow',
        'elephant', 'bear', 'zebra', 'giraffe', 'backpack', 'umbrella', 'handbag', 'tie', 'suitcase', 'frisbee',
        'skis', 'snowboard', 'sports ball', 'kite', 'baseball bat', 'baseball glove', 'skateboard', 'surfboard',
        'tennis racket', 'bottle', 'wine glass', 'cup', 'fork', 'knife', 'spoon', 'bowl', 'banana', 'apple',
        'sandwich', 'orange', 'broccoli', 'carrot', 'hot dog', 'pizza', 'donut', 'cake', 'chair', 'couch',
        'potted plant', 'bed', 'dining table', 'toilet', 'tv', 'laptop', 'mouse', 'remote', 'keyboard', 
        'cell phone', 'microwave', 'oven', 'toaster', 'sink', 'refrigerator', 'book', 'clock', 'vase', 'scissors', 
        'teddy bear', 'hair drier', 'toothbrush']

2. Create Labels

라벨을 YOLO format으로 준비 합니다. 만약 이미지에 물체(Object)가 존재하지 않는다면, *.txt 파일은 생성되지 않습니다.

특징

- 한줄에 하나의 이미지를 포함합니다.

- 각각의 행은 (class, x_center, y_center, width, height) 형식입니다.

- 좌표 값은 0 ~ 1 사이 값으로 정규화 되어 있습니다.

3. Organize Directories

아래와 같은 폴더 구조로 이미지와 라벨을 배치합니다. YOLOv5는 이미지에 대응하는 라벨을 자동적으로 찾습니다(아마도 확장자가 다른 동일 이름으로 찾는 것 같습니다. e.g. 'im1.jpg <-> im1.txt' 쌍 매칭됨)

|- yolov5

|- {데이터셋 이름}

|- images/im0.jpg, ... imN.jpg

|- labels/im0.txt, ...imN.txt

4. Select a Model

- 사이트에서 목적에 맞는 모델을 선택합니다. 즉, 제어기가 수용가능한 메모리 사이즈를 고려해서 모델을 선택합니다.

5. Train

- 미리 학습된 가중치를 다운하고, 배치사이즈 등을 설정하여 학습합니다.

- Pretrained 가중치 사용하지 않고, 랜덤으로 초기화된 가중치 + yolov5X.yaml 을 사용하는 것은 추천하지 않습니다.

(e.g. --weights '' --cfg yolov5s.yaml)

# Train YOLOv5s on COCO128 for 5 epochs
$ python train.py --img 640 --batch 16 --epochs 5 --data coco128.yaml --weights yolov5s.pt

- runs/train/expX 폴더가 생성되면서 학습 과정이 저장됩니다.

6. Visualize

Weights & Biases 로깅 도구를 제공하여 클라우드 상에서 실시간으로 학습 현황을 살펴 볼 수 있습니다.

pip install wandb # 설치

물론 runs/train/expX 폴더에 로컬로도 저장되어 확인이 가능합니다. 텐서보드 형식 (+ results.txt 파일)로 저장되어 텐서보드로 열어 볼 수 있습니다.

from utils.plots import plot_results 
plot_results(save_dir='runs/train/exp')  # plot results.txt as results.png

AWS, GCP, Docker Image 로 학습 환경을 제공합니다.