The existing /chat/completions endpoints all support asynchronous responses—just add the parameter async=true to your query. This feature is provided at no extra cost. We have also added a callback parameter to support webhooks; note that callbacks require async=true.