When it comes to utilizing AI for text generation, there is a noticeable difference between using a chat interface and an API. In my experience, the output produced by an AI model in a chat setting tends to be far superior compared to the output generated through an API provided by Make, even when using the same prompt and top-notch models.
Case Study: Editing HTML Files with Grok
Let's consider a specific case where I am working on editing and rewriting HTML files, particularly emails, using Grok. When I engage with Grok3 in a chat scenario, the results are truly impressive. However, when I switch to using the API with the Grok3-beta model, the HTML output I receive is often fragmented and incomplete. This pattern seems to hold true for other models like Deepseek as well, although I have not conducted a direct comparison with those.
It raises the question - has anyone else encountered this disparity and managed to find a solution to address it?

Thank you for your insights!
