Skip to Main Content
Shape the future of IBM watsonx Orchestrate

Start by searching and reviewing ideas others have posted, and add a comment (private if needed), vote, or subscribe to updates on them if they matter to you.

If you can't find what you are looking for, create a new idea:

  1. stick to one feature enhancement per idea

  2. add as much detail as possible, including use-case, examples & screenshots (put anything confidential in Hidden details field or a private comment)

  3. Explain business impact and timeline of project being affected

[For IBMers] Add customer/project name, details & timeline in Hidden details field or a private comment (only visible to you and the IBM product team).

This all helps to scope and prioritize your idea among many other good ones. Thank you for your feedback!

Specific links you will want to bookmark for future use
Learn more about IBM watsonx Orchestrate - Use this site to find out additional information and details about the product.
Welcome to the IBM Ideas Portal (https://www.ibm.com/ideas) - Use this site to find out additional information and details about the IBM Ideas process and statuses.
IBM Unified Ideas Portal (https://ideas.ibm.com) - Use this site to view all of your ideas, create new ideas for any IBM product, or search for ideas across all of IBM.
ideasibm@us.ibm.com - Use this email to suggest enhancements to the Ideas process or request help from IBM for submitting your Ideas.

Status Needs more information
Created by Guest
Created on Sep 3, 2025

Instead of 504 errors, concise errors should be surfaced to the user, instead of having to dig through CP4D/vLLM logs

"Instead of 504 errors, if it's something surfaced from vLLM like:

2025-08-22T00:41:31.362506975Z ERROR 08-22 00:41:31 [serving_chat.py:200] ValueError: This model's maximum context length is 128000 tokens. However, you requested 844373 tokens (840277 in the messages, 4096 in the completion). 

Please reduce the length of the messages or completion have meaningful error messages visible to the client so they don't have to open a support ticket to have someone triage the logs.


Business Impact

Improves client experience: Users can immediately understand and resolve errors without relying on IBM support.
Reduces support burden: Fewer L1/L2 tickets for basic error triage, freeing up IBM resources.
Accelerates adoption: Faster troubleshooting builds user confidence in watsonx/CP4D as a reliable platform.
Enhances transparency: Aligns with enterprise expectations for usable and trustworthy AI services.
Competitive Differentiation: Many cloud providers already surface actionable model errors; providing this helps IBM keep pace and exceed expectations.

Urgency

With enterprise clients (e.g., Verizon) actively deploying LLM workloads, delays caused by opaque 504 errors introduce frustration and loss of productivity. Addressing this gap will have an immediate positive impact on client trust and ongoing adoption.

Idea priority Medium
  • Admin
    Laurent Tillette de Clermont-Tonnerre
    Sep 4, 2025