Chat（Image analysis）

POST

/chat/completions

adds image recognition capabilities to all models, and there are two ways to enable it, you can choose either one:

Append -ocr to all model suffixes, for example gpt-3.5-turbo-ocr (convenient for third-party software)

Specify the OCR model when requesting the model: ocr_model:gpt-4o-mini, as shown in the example (convenient for API)

Note: If the suffix of the multimodal model has -ocr, it will also use the specified or default OCR model for image analysis, so try to avoid enabling this feature on multimodal models

The principle of this function is: before each request, the user's image is sent to the multimodal model for analysis, and then the analysis results are incorporated into the model context as reference information. The specific process can be viewed in the logs during the API call. The default OCR model currently in use is gpt-4o-mini.

Image analysis prompt words:

Price: On the basis of the original model + the cost of the multimodal model

Request

Header Params

Body Params application/json

Example

{
  "model": "gpt-3.5-turbo-ocr",
  "ocr_model":"gpt-4o-mini",
  "stream": false,
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "这张图片有什么"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://s2.loli.net/2024/02/01/QSWVdw9bX56gj7O.jpg"
          }
        }
      ]
    }
  ]
}

Request Code Samples

Shell

JavaScript

Java

Swift

PHP

Python

HTTP

Objective-C

Ruby

OCaml

Dart

curl --location --request POST 'https://api.302.ai/v1/chat/completions' \
--header 'Accept: application/json' \
--header 'Authorization: Bearer ' \
--header 'Content-Type: application/json' \
--data-raw '{
  "model": "gpt-3.5-turbo-ocr",
  "ocr_model":"gpt-4o-mini",
  "stream": false,
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "这张图片有什么"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://s2.loli.net/2024/02/01/QSWVdw9bX56gj7O.jpg"
          }
        }
      ]
    }
  ]
}'

Responses

🟢200OK

application/json

Body

Example

{
    "id": "chatcmpl-123",
    "object": "chat.completion",
    "created": 1677652288,
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "\n\nHello there, how may I assist you today?"
            },
            "finish_reason": "stop"
        }
    ],
    "usage": {
        "prompt_tokens": 9,
        "completion_tokens": 12,
        "total_tokens": 21
    }
}

Modified at 2025-02-13 02:03:02

Chat（Depth-First Search）

Chat（Reasoning mode）