Vision Models Archives - Wayne's Talk

29/06/2025

82 views

In the field of image recognition, Convolutional Neural Networks (CNNs) have long been the dominant architecture. In recent years, Transformer models have achieved great success in Natural Language Processing (NLP), which has led researchers to consider applying the Transformer architecture to image processing tasks. Vision Transformer (ViT) is a model designed for image understanding based on the Transformer framework.

82 views
11 minute read

Vision Transformer Model

ByWayne
29/06/2025

86 views
5 minute read

CLIP Model

ByWayne
03/06/2025

CLIP (Contrastive Language-Image Pre-training) is a model proposed by OpenAI in 2021. It achieves strong generalization capability by integrating visual and language representations, and it has extensive potential applications. This article will introduce both the theory and practical implementation of CLIP.

242 views
8 minute read

Convolutional Neural Networks (CNN)

ByWayne
23/01/2025

Convolutional neural networks (CNN) is a computer vision and image processing method based on neural networks. In this article, we will introduce the principles of various layers in CNN.

2.5K views
5 minute read

Executing YOLOv8 Models on Android Using ONNX Runtime

ByWayne
19/06/2024

Open Neural Network Exchange (ONNX) is a model format defined by several major manufacturers. ONNX Runtime is a library that can execute ONNX models. It was developed by Microsoft. It supports multiple platforms, including Android.

1.6K views
4 minute read

Executing YOLOv8 Models on Android Using PyTorch

ByWayne
19/06/2024

PyTorch is a machine learning library developed by Meta. YOLOv8 also uses Pytorch internally. In addition to Python environments, we can now use PyTorch in non-Python environments.

1.7K views
4 minute read

Non Maximum Suppression (NMS)

ByWayne
29/05/2024

Non maximum suppression is a technique used in object detection to filter bounding boxes generated by object detection algorithms. If we don’t use NMS, we will get an image with dense frames.

1.3K views
5 minute read

YOLOv8 Object Detection Tutorial

ByWayne
23/05/2024

YOLO (You Only Look Once) is a popular object detection model. Its high performance and high accuracy made it popular quickly. This article will introduce how to use YOLOv8 for object detection.

Get source code of posts.

Vision Transformer Model

Vision Models

Vision Transformer Model

CLIP Model

Convolutional Neural Networks (CNN)

Executing YOLOv8 Models on Android Using ONNX Runtime

Executing YOLOv8 Models on Android Using PyTorch

Non Maximum Suppression (NMS)

YOLOv8 Object Detection Tutorial

Bradley-Terry Model

Entropy

Byte-Pair Encoding

Policy Gradient

On-Policy Control with Approximation

Spring Security JWT Authentication with Google Sign-In Explained

How to Backup and Restore MySQL Databases in Spring Boot

Sending Push Notifications Using FCM in Spring Boot

Python Pie/Donut/Sunburst Charts

Kotlin Coroutine Flow Tutorial

Spring Security JWT Authentication with Google Sign-In Explained

How to Backup and Restore MySQL Databases in Spring Boot

Sending Push Notifications Using FCM in Spring Boot

Python Pie/Donut/Sunburst Charts