Object Detection to Identify and Track Objects | Fritz AI

Object Detection

Use Object Detection to identify and track things within the contents of an image or each frame of live video. Enhances a range of experiences from photography to autonomy.

Getting Started

import Fritz

var objectModel: FritzVisionObjectModel?

let image = FritzVisionImage(image: uiImage)

objectModel.predict(image, options: options) { objects, error in
    if let objects = objects, objects.count > 0 {
        DispatchQueue.main.async {
            self.drawBoxes(predictions: objects)
Object Detection

The Swift code sample here illustrates how simple it can be to use object detection in your app. Use the links below to access additional documentation, code samples, and tutorials that will help you get started.


Recognizes 91 Objects

Bounding boxes provided for each object detected.

Our mobile-friendly model was trained on COCO, a large-scale object detection dataset. Predicts objects such as:

Model Variants

Fast: Optimized for speed, best for processing video streams in real-time or on older devices.

Accurate: Optimized for higher accuracy where prediction quality is more important than speed.

Small: Optimized for size, keep your application bundle size low and conserve bandwidth.

Runs On-Device

All predictions / model inferences are made completely on-device.

No internet connection is required to interpret images or video.

No internet dependency means super-fast performance.

Cross-Platform SDKs

Supported mobile platforms:

  • Android Object Detection
  • iOS Object Detection
Live Video Performance

Runs on live video with a fast frame rate.

Exact FPS performance varies depending on device, but it is possible to run this feature on live video on modern mobile devices.

Technical Specifications


SSDLite + MobileNet V2 variant

Model Size

~17 MB


800 M


300x300-pixel image


Offsets for >2,000 candidate bounding boxes

Class labels for each box

Confidence scores for each box


Core ML, TensorFlow, TensorFlow Lite, TensorFlow Mobile, Keras


18 FPS on iPhone X

8 FPS on Pixel 2