# Preparing & Uploading Data

### Overview

A good place to start when creating and training a new model is with a dataset containing at least **500 images**. Include diverse lighting, angles, and contexts for real-world applicability. The dataset should also include **negative images**. Negative images do not include the target object, and are helpful to reduce false positives in your trained model. With a beginning dataset of 500 images, aim for 400 images (or 80%) showing your object, and 100 (20%) negative images.

{% hint style="info" %}

### (Classification / Auto-Tag Models Only)&#x20;

### Tip: Prelabel Images by Folder Name

If your images are organized into folders named after their class labels (e.g., `cat/`, `dog/`, `car/`), you can **drag and drop the folders directly** into the upload interface to have the images **automatically pre-tagged** with the corresponding class.

For example:

```
training_images/
├── cat/
│   ├── image1.jpg
│   └── image2.jpg
├── dog/
│   └── image3.jpg
```

When you upload this structure:

* All images inside `cat/` will be automatically tagged with the `"cat"` class
* All images inside `dog/` will be tagged with `"dog"`

This saves time and ensures consistent labeling during dataset creation.

**Note:** This feature works best when folder names exactly match your intended class names.
{% endhint %}

### What Are Negative Images?

**Negative images** are photos that do NOT contain your target object. These images are crucial for training your model to distinguish between what you want to detect and what you don't.

#### Why Use Negative Images?

* **Reduce false positives**: Help your model learn what NOT to classify as your target object
* **Improve accuracy**: Train the model to be more discriminating
* **Handle similar objects**: Teach the model to distinguish between similar-looking items

#### Examples of Negative Images:

* If training a door detection model: Include images of windows, archways, or openings that are NOT doors
* If training a car detection model: Include images of trucks, motorcycles, or street scenes without cars
* If training a person detection model: Include images of mannequins, statues, or scenes without people

### How to Organize Your Dataset

#### Folder Structure

Organize your images into clear folder structures before uploading:

```
your-dataset/
├── positive-images/
│   ├── door_001.jpg
│   ├── door_002.jpg
│   └── ... (400 images of doors)
└── negatives/
    ├── window_001.jpg
    ├── archway_001.jpg
    └── ... (100 images without doors)
```

**Important**: The folder containing negative images must be named **"negatives"** (all lowercase, plural).

### How to Upload Negative Images

#### Step-by-Step Process:

1. **Prepare your negative images folder**
   * Create a folder named exactly `negatives` (lowercase, plural)
   * Add approximately 100 images that do NOT contain your target object
   * Ensure images show similar contexts or potentially confusing objects
2.

```
<figure><img src="/files/kWivRYC94cK0OMkAtbwx" alt=""><figcaption></figcaption></figure>
```

3. **Navigate to the upload interface**

   * Go to the **Models** tab in the sidebar
   * Find your Image Collection
   * Click the **three dots (...)** menu next to your dataset
   * Select **Upload**

   <figure><img src="/files/wz8D7GPvthqINP1C3cVf" alt="" width="375"><figcaption></figcaption></figure>
4. **Upload the negatives folder**

   * On the upload screen, locate the "add more images" tile

   <figure><img src="/files/fmZQGKavzDEnVEsAkANe" alt="" width="375"><figcaption></figcaption></figure>

   * **Drag and drop or upload** your entire `negatives` folder onto this tile
   * The system will automatically recognize this as negative training data

#### Upload Tips:

* You can drag and drop the entire folder at once
* The upload area will accept the folder and process all images inside
* Make sure your folder is named exactly "negatives" for proper recognition

### Best Practices for Negative Images

#### Quality Guidelines:

* **Similar contexts**: Use images from similar environments where your target object might appear
* **Potential confusers**: Include objects that might be mistaken for your target (like windows when training for doors)
* **Varied conditions**: Include different lighting, angles, and backgrounds
* **High quality**: Use clear, well-lit images similar to your positive examples

#### Common Mistakes to Avoid:

* Don't use completely unrelated images (random landscapes for a door model)
* Avoid low-quality or blurry negative images
* Don't include images that partially show your target object
* Don't forget to name the folder exactly "negatives"

### Recommended Dataset Ratios

| Dataset Size | Positive Images | Negative Images | Ratio |
| ------------ | --------------- | --------------- | ----- |
| 500 images   | 400 images      | 100 images      | 80/20 |
| 1000 images  | 800 images      | 200 images      | 80/20 |
| 2000+ images | 1600+ images    | 400+ images     | 80/20 |

### Frequently Asked Questions

#### Q: How do I upload negative images?

**A:** Create a folder named "negatives" with \~100 images that don't contain your target object. Go to Models → click the three dots (...) on your dataset → select Upload → drag and drop the "negatives" folder onto the "add more images" tile.

#### Q: What should negative images contain?

**A:** Images that are similar to where your target object appears, but without the target object itself. For example, if training a door model, include images of windows, archways, or wall sections that might be confused for doors.

#### Q: How many negative images do I need?

**A:** Start with about 20% of your total dataset as negative images. For a 500-image dataset, use approximately 100 negative images.

#### Q: Can I add negative images after initial training?

**A:** Yes, you can upload additional negative images to improve your model's performance and reduce false positives.

***


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.eyepop.ai/developer-documentation/self-service-training/how-to-train-a-model/preparing-and-uploading-data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
