當前位置：首頁 > 编程语言 > python >内容正文

python

yolov3 python_Python 3 Keras YOLO v3解析与实现

發布時間：2023/12/15 python 26 豆豆

生活随笔收集整理的這篇文章主要介紹了 yolov3 python_Python 3 Keras YOLO v3解析与实现小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

YOLOv3在YOLOv2的基礎進行了一些改進，這些更改使其效果變得更好。在320×320的圖像上，YOLOv3運行速度達到了22.2毫秒，mAP為28.2。其與SSD一樣準確，但速度快了三倍，具體效果如下圖。本文對YOLO v3的改進點進行了總結，并實現了一個基于Keras的YOLOv3檢測模型。

inference

環境

Python 3.6

Tensorflow-gpu 1.5.0

Keras 2.1.3

OpenCV 3.4

改進點

1.Darknet-53特征提取網絡

不同于Darknet-19，YOLO v3中使用了一個53層的卷積網絡，這個網絡由殘差單元疊加而成。根據作者的實驗，在分類準確度上跟效率的平衡上，這個模型比ResNet-101、 ResNet-152和Darknet-19表現得更好。

Darknet-53

2.邊界框預測

基本的坐標偏移公式與YOLO v2相同。

box

YOLO v3使用邏輯回歸預測每個邊界框的分數。如果先驗邊界框與真實框的重疊度比之前的任何其他邊界框都要好，則該值應該為1。如果先驗邊界框不是最好的，但確實與真實對象的重疊超過某個閾值(這里是0.5)，那么就忽略這次預測。YOLO v3只為每個真實對象分配一個邊界框，如果先驗邊界框與真實對象不吻合，則不會產生坐標或類別預測損失，只會產生物體預測損失。

3.類別預測

為了實現多標簽分類，模型不再使用softmax函數作為最終的分類器，而是使用logistic作為分類器，使用 binary cross-entropy作為損失函數。

4.多尺度預測

不同于之前的YOLO，YOLO v3從三種不同尺度的特征圖譜上進行預測任務。

在Darknet-53得到的特征圖的基礎上，經過7個卷積得到第一個特征圖譜，在這個特征圖譜上做第一次預測。

然后從后向前獲得倒數第3個卷積層的輸出，進行一次卷積一次x2上采樣，將上采樣特征與第43個卷積特征連接，經過7個卷積得到第二個特征圖譜，在這個特征圖譜上做第二次預測。

然后從后向前獲得倒數第3個卷積層的輸出，進行一次卷積一次x2上采樣，將上采樣特征與第26個卷積特征連接，經過7個卷積得到第三個特征圖譜，在這個特征圖譜上做第三次預測。

每個預測任務得到的特征大小都為N ×N ×[3?(4+1+80)] ，N為格子大小，3為每個格子得到的邊界框數量， 4是邊界框坐標數量，1是目標預測值，80是類別數量。

out

實驗

實現了一個輸入大小為(416, 416)的yolo v3檢測模型，模型使用了coco訓練的權值文件。

權值文件轉換

參考了yad2k項目的轉換方法，我們為其添加了幾個新的層，用來將Darknet的網絡結構和權值文件轉換為keras 2的網絡結構和權值文件。

執行下列命令轉換

python yad2k.py cfg\yolo.cfg yolov3.weights data\yolo.h5

檢測

demo.py文件提供了使用yolo v3進行檢測的例子。圖片檢測結果輸出到images\res文件夾。

"""Demo for use yolo v3

"""

import os

import time

import cv2

import numpy as np

from model.yolo_model import YOLO

def process_image(img):

"""Resize, reduce and expand image.

# Argument:

img: original image.

# Returns

image: ndarray(64, 64, 3), processed image.

"""

image = cv2.resize(img, (416, 416),

interpolation=cv2.INTER_CUBIC)

image = np.array(image, dtype='float32')

image /= 255.

image = np.expand_dims(image, axis=0)

return image

def get_classes(file):

"""Get classes name.

# Argument:

file: classes name for database.

# Returns

class_names: List, classes name.

"""

with open(file) as f:

class_names = f.readlines()

class_names = [c.strip() for c in class_names]

return class_names

def draw(image, boxes, scores, classes, all_classes):

"""Draw the boxes on the image.

# Argument:

image: original image.

boxes: ndarray, boxes of objects.

classes: ndarray, classes of objects.

scores: ndarray, scores of objects.

all_classes: all classes name.

"""

for box, score, cl in zip(boxes, scores, classes):

x, y, w, h = box

top = max(0, np.floor(x + 0.5).astype(int))

left = max(0, np.floor(y + 0.5).astype(int))

right = min(image.shape[1], np.floor(x + w + 0.5).astype(int))

bottom = min(image.shape[0], np.floor(y + h + 0.5).astype(int))

cv2.rectangle(image, (top, left), (right, bottom), (255, 0, 0), 2)

cv2.putText(image, '{0} {1:.2f}'.format(all_classes[cl], score),

(top, left - 6),

cv2.FONT_HERSHEY_SIMPLEX,

0.6, (0, 0, 255), 1,

cv2.LINE_AA)

print('class: {0}, score: {1:.2f}'.format(all_classes[cl], score))

print('box coordinate x,y,w,h: {0}'.format(box))

print()

def detect_image(image, yolo, all_classes):

"""Use yolo v3 to detect images.

# Argument:

image: original image.

yolo: YOLO, yolo model.

all_classes: all classes name.

# Returns:

image: processed image.

"""

pimage = process_image(image)

start = time.time()

boxes, classes, scores = yolo.predict(pimage, image.shape)

end = time.time()

print('time: {0:.2f}s'.format(end - start))

if boxes is not None:

draw(image, boxes, scores, classes, all_classes)

return image

def detect_vedio(video, yolo, all_classes):

"""Use yolo v3 to detect video.

# Argument:

video: video file.

yolo: YOLO, yolo model.

all_classes: all classes name.

"""

camera = cv2.VideoCapture(video)

cv2.namedWindow("detection", cv2.WINDOW_NORMAL)

while True:

res, frame = camera.read()

if not res:

break

image = detect_image(frame, yolo, all_classes)

cv2.imshow("detection", image)

if cv2.waitKey(110) & 0xff == 27:

break

camera.release()

if __name__ == '__main__':

yolo = YOLO(0.6, 0.5)

file = 'data/coco_classes.txt'

all_classes = get_classes(file)

# detect images in test floder.

for (root, dirs, files) in os.walk('images/test'):

if files:

for f in files:

print(f)

path = os.path.join(root, f)

image = cv2.imread(path)

image = detect_image(image, yolo, all_classes)

cv2.imwrite('images/res/' + f, image)

# detect vedio.

video = 'E:/video/car.flv'

detect_vedio(video, yolo, all_classes)

結果

運行python demo.py

dog.png

eagle.png

giraffe.png

horses.png

person.png

總結

以上是生活随笔為你收集整理的yolov3 python_Python 3 Keras YOLO v3解析与实现的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：电脑nisec用户管理工具有什么作用
下一篇： AE错误代码解释