The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide consistently better answers.
Recent activity on o1-pro
Total usage per day on OpenRouter
Reasoning
261K
Prompt
112K
Completion
25K
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.