{"id":137,"date":"2025-04-03T00:49:00","date_gmt":"2025-04-02T16:49:00","guid":{"rendered":"https:\/\/blog.liu-qi.cn\/?p=137"},"modified":"2026-04-18T21:51:38","modified_gmt":"2026-04-18T13:51:38","slug":"%e5%a6%82%e4%bd%95%e7%9c%8b%e5%be%85-gpt-4o-%e5%87%ba%e7%8e%b0%e5%90%8e%ef%bc%8ccomfyui-%e7%ad%89%e4%b8%80%e4%bc%97%e5%b7%a5%e4%bd%9c%e6%b5%81%e8%bd%af%e4%bb%b6%e4%bc%9a%e8%a2%ab%e6%b7%98%e6%b1%b0","status":"publish","type":"post","link":"https:\/\/en.blog.liu-qi.cn\/2025\/04\/03\/%e5%a6%82%e4%bd%95%e7%9c%8b%e5%be%85-gpt-4o-%e5%87%ba%e7%8e%b0%e5%90%8e%ef%bc%8ccomfyui-%e7%ad%89%e4%b8%80%e4%bc%97%e5%b7%a5%e4%bd%9c%e6%b5%81%e8%bd%af%e4%bb%b6%e4%bc%9a%e8%a2%ab%e6%b7%98%e6%b1%b0\/","title":{"rendered":"Why Workflow Tools Like ComfyUI Won't Be Replaced by GPT-4o Anytime Soon"},"content":{"rendered":"<p>I came across this question on Zhihu. After GPT-4o&#8217;s image generation became popular, some people say Photoshop will be replaced, while others say ComfyUI will be replaced.<\/p>\n<p>This kind of talk is essentially the same as saying Manus will replace Coze.<\/p>\n<p>As large models gain stronger capabilities, they will inevitably internalize workflow functionalities and the effectiveness of prompts\u2014this is something any observant person can see.<\/p>\n<p>What is the core of a workflow?<\/p>\n<p>The core of a workflow lies in weaving determinism.<\/p>\n<p>Nodes and plugins are not the core aspects; when a large model&#8217;s coding capabilities become strong enough, it can develop those things itself.<\/p>\n<p>What a workflow provides in the face of large models is determinism.<\/p>\n<p>What exactly does a large model&#8217;s output represent?<\/p>\n<p>It&#8217;s probability.<\/p>\n<p>When you ask a large model to draw a sky, even if you want a clear blue sky, the output might not be 100% blue sky.<\/p>\n<p>Because the sky is dark at night, red at dusk, and gray on rainy days\u2014these possibilities inherently exist. So at best, you can draw a blue sky most of the time, but not 100% guaranteed.<\/p>\n<p>For complex tasks, it&#8217;s even more so; with various possibilities intertwined, you can&#8217;t ensure the large model&#8217;s output for the same task is 100% consistent.<\/p>\n<p>And often, uncertain things cannot be used in production environments.<\/p>\n<p>I tell my colleagues that when making tables, use formulas whenever possible instead of AI fields\u2014this is the reasoning behind it.<\/p>\n<p>It&#8217;s even more energy-efficient and environmentally friendly.<\/p>\n<p>The same logic applies to workflows.<\/p>\n<p>Through a workflow, you can plan steps: first do this, then do that; first load a checkpoint, then load a LoRA\u2014this ensures there are no errors.<\/p>\n<p>For example, if you want to redraw a face, your workflow must first extract the face, then perform the redrawing\u2014even if a single node packages both functions, it still extracts the face first before redrawing. That&#8217;s how face redrawing works, right? Otherwise, wouldn&#8217;t you just redraw the entire image?<\/p>\n<p>Coincidentally, GPT-4o redraws the entire image.<\/p>\n<p>I sent it such an image, found on Baidu:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/04\/001-553720454c54.png\" \/><\/p>\n<p>I asked it to remove the tattoo from this girl, and it returned this image.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/04\/002-ff77c9349c50.png\" \/><\/p>\n<p>The tattoo was indeed removed.<\/p>\n<p>But it&#8217;s not &#8216;this girl&#8217;.<\/p>\n<p>Looking closely at the comparison, the redrawn image is really, really similar.<\/p>\n<p>But not only has the girl changed.<\/p>\n<p>The bag on the table became a hat, the polygonal glass cup in her hand became a round cup, the shoes on her feet changed, the hand posture and sitting position changed, and the curtain texture also changed&#8230;<\/p>\n<p>The entire image is completely different.<\/p>\n<p>Where is the determinism?<\/p>\n<p>If this girl is my client, paying me to remove her tattoo, would she accept this kind of image?<\/p>\n<p>Let&#8217;s give another example.<\/p>\n<p>When Robert Downey Jr. officially announced his role as Doctor Doom, he wore a mask.<\/p>\n<p>Here is a photo of him with the mask:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/04\/003-3125d93d49a2.png\" \/><\/p>\n<p>Now I want to remove this mask using a ComfyUI workflow:<\/p>\n<p>Extract the mask, then remove it.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/04\/004-f201981a6bb5.png\" \/><\/p>\n<p>The mask is removed, and everything else remains unchanged.<\/p>\n<p>Now let&#8217;s use 4o to do it:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/04\/005-3c1fc4bbbb44.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/04\/006-45feb7e19a29.png\" \/><\/p>\n<p>The details become richer, more textured, and even the person looks younger\u2014how wonderful.<\/p>\n<p>But look closely: the eye color changed, the hand lowered, the robe turned into a hoodie, and the thing around the neck looks like a stethoscope.<\/p>\n<p>This image still clearly shows it&#8217;s Robert Downey Jr.; you can post it on Twitter or social media without issue, but it cannot be used seriously. Wearing a hoodie, you could say he&#8217;s Iron Man, but you can&#8217;t say he&#8217;s Doctor Doom.<\/p>\n<p>So, what does a workflow bring? It allows you to target precisely.<\/p>\n<p>If you want to change A, change A\u2014don&#8217;t mess with B, C, or D, even if it&#8217;s just a probability.<\/p>\n<p>Returning to Manus and Coze, it&#8217;s the same.<\/p>\n<p>Can AI automatically complete the workflows we painstakingly built? Yes, it can.<\/p>\n<p>But does Manus never make mistakes? Of course not; error cases are everywhere.<\/p>\n<p>As long as AI cannot independently complete sufficiently complex tasks, humans need to design workflows for it\u2014first this, then that\u2014to improve its task accuracy.<\/p>\n<p>Of course, it cannot be denied that large models are indeed internalizing many capabilities.<\/p>\n<p>For instance, prompts.<\/p>\n<p>When ChatGPT first came out, defining a role for the model\u2014&#8217;you are an expert in the XXX field&#8217;\u2014was usually very effective.<\/p>\n<p>The difference in results with and without role definition was significant.<\/p>\n<p>Using structured prompts yielded good results.<\/p>\n<p>So at that time, everyone thought prompts were crucial, a field of study, and Prompt Engineer would become a widespread job in the future.<\/p>\n<p>But later, things changed; many models performed well even without role definition. Why? Because large model developers aren&#8217;t stupid\u2014they noticed that prompts were highly effective, so why not train them directly into the model for it to determine the appropriate role itself?<\/p>\n<p>Later, reasoning models emerged.<\/p>\n<p>When DeepSeek-R1 became wildly popular, people realized they didn&#8217;t need to give large models complex, structured prompts to get decent responses. I didn&#8217;t need to become a Prompt Engineer to use AI happily.<\/p>\n<p>Of course, prompts still matter; good prompts still yield vastly different results compared to casually asked questions. But at least for simple tasks, there&#8217;s no longer a need for dedicated prompt engineers to write prompts. AI can think on its own, understand user intent, and provide the most appropriate responses.<\/p>\n<p>Prompt Engineers won&#8217;t disappear, but since simple tasks no longer require them, the barrier will certainly be higher, requiring more skills, and it won&#8217;t be as common.<\/p>\n<p>Let me repeat: large models are internalizing many capabilities that are currently attached to them as external tools.<\/p>\n<p>Because technology is advancing, not regressing.<\/p>\n<p>AI will ultimately lead to AGI\u2014this is a goal everyone knows.<\/p>\n<p>The uncertainty that workflows need to mitigate will become smaller and smaller\u2014this is an undeniable inference.<\/p>\n<p>As large models&#8217; capabilities grow, they are squeezing into this space.<\/p>\n<p>4o doesn&#8217;t support local redrawing now, but it might support it soon.<\/p>\n<p>AI couldn&#8217;t generate Chinese last year, but now it does quite well, doesn&#8217;t it?<\/p>\n<p>But saying it will eliminate workflows is still premature.<\/p>\n<p>If AGI is 100%, then until AGI is achieved, it&#8217;s naturally not 100%.<\/p>\n<p>So there will still be a need for manually crafted workflows.<\/p>\n<p>Learning is definitely not in vain; a crucial point is not to think in absolutes\u2014what is eliminated is far less important than embracing and integrating new things.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Despite GPT-4o&#8217;s advancements, workflow tools like ComfyUI remain essential for ensuring deterministic and controllable outputs in professional tasks.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24],"tags":[8,6,9],"class_list":["post-137","post","type-post","status-publish","format-standard","hentry","category-articles","tag-ai-","tag-comfyui","tag-prompt"],"_links":{"self":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts\/137","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/comments?post=137"}],"version-history":[{"count":0,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts\/137\/revisions"}],"wp:attachment":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/media?parent=137"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/categories?post=137"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/tags?post=137"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}