{"id":136,"date":"2025-03-29T22:56:00","date_gmt":"2025-03-29T14:56:00","guid":{"rendered":"https:\/\/blog.liu-qi.cn\/?p=136"},"modified":"2026-04-18T21:51:27","modified_gmt":"2026-04-18T13:51:27","slug":"gpt-4o%e5%86%8d%e5%8a%a0%e4%b8%8a%e8%bf%99%e5%bc%a0%e8%a1%a8%ef%bc%8c%e8%ae%a9ai%e8%87%aa%e5%8a%a8%e7%94%9f%e6%88%90%e5%b0%8f%e7%ba%a2%e4%b9%a6%e5%b0%81%e9%9d%a2%e5%9b%be","status":"publish","type":"post","link":"https:\/\/en.blog.liu-qi.cn\/2025\/03\/29\/gpt-4o%e5%86%8d%e5%8a%a0%e4%b8%8a%e8%bf%99%e5%bc%a0%e8%a1%a8%ef%bc%8c%e8%ae%a9ai%e8%87%aa%e5%8a%a8%e7%94%9f%e6%88%90%e5%b0%8f%e7%ba%a2%e4%b9%a6%e5%b0%81%e9%9d%a2%e5%9b%be\/","title":{"rendered":"GPT-4o and a Template for AI-Generated Xiaohongshu Cover Images"},"content":{"rendered":"<p>It&#8217;s not that I&#8217;m so immersed in AI that I forgot to update, but rather that AI is truly endless to explore\u2014literally endless.<\/p>\n<p>GPT-4o&#8217;s multimodal image generation has been live for over three days, with new uses emerging daily and endless creative ways to edit images with just your voice. I couldn&#8217;t resist for even a single day\u2014I signed up for GPT Plus that evening right after work.<\/p>\n<p>I tried generating a couple of images using the most basic prompts, and the results were absolutely mind-blowing.<\/p>\n<p>Swap products in one second \u2b07\ufe0f<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/001-d4d7c0474f07.png\" \/><\/p>\n<p>Transform male to female, change men&#8217;s clothing to women&#8217;s wear \u2b07\ufe0f<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/002-592430a74ab6.png\" \/><\/p>\n<p>Over the next two days, various stunning applications kept coming non-stop.<\/p>\n<p>So it&#8217;s not that I&#8217;m not updating, but there&#8217;s really nothing new to write about\u2014I can&#8217;t even keep up with learning it all.<\/p>\n<p>That was until I suddenly realized that everyone seems to be playing around with image-to-image generation.<\/p>\n<p>Image-to-image requires having an image or multiple images to start with, making the barrier a bit higher, and the generation speed is also slightly slower than text-to-image.<\/p>\n<p>As for me, a working-class blogger who isn&#8217;t financially free, I need to focus on productivity\u2014and I also need to make the tools accessible enough for others to use.<\/p>\n<p>So I brought back my trusty companion, the multi-dimensional table, and put together this template for generating Xiaohongshu (Little Red Book) cover images via text-to-image:<\/p>\n<p>https:\/\/ilovezhiwai.feishu.cn\/wiki\/XO7BwzedJi2PspkQKxCcxRBnnUw?table=ldxlY7sLPeUzd7ai<\/p>\n<p>It can generate covers in both image-text and text-only formats:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/003-d5c714ad5ef8.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/004-74b36b4ec049.png\" \/><\/p>\n<p>It&#8217;s very simple to use.<\/p>\n<p>First, let&#8217;s talk about the image-text cover.<\/p>\n<p>The first three columns are for input, where the &#8216;Note Topic&#8217; field is required. &#8216;Additional Info\/Requirements&#8217; and &#8216;Style Preset&#8217; can be left blank. However, filling in the latter two can better control the visual elements and style\u2014specific examples are pre-filled in the table for reference.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/005-af1a35e77676.png\" \/><\/p>\n<p>Once the input fields are filled, wait for the AI to generate the corresponding prompts.<\/p>\n<p>After generation, simply copy the content from the &#8216;Prompt Ctrl+C&#8217; field and send it to ChatGPT 4o to generate the image.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/006-33b9b2b99425.png\" \/><\/p>\n<p>Through flexible configuration, you can create covers suited for various scenarios and different styles.<\/p>\n<p>For example, if you&#8217;re a photography blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/007-5c3c68f1be96.png\" \/><\/p>\n<p>A travel blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/008-44db8e3156c6.png\" \/><\/p>\n<p>A finance blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/009-88fb026e937d.png\" \/><\/p>\n<p>A news commentary blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/010-99f18456bd0e.png\" \/><\/p>\n<p>A knowledge and education blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/011-43b4dd66733d.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/012-59e3655baef6.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/013-0e6c3fb4eb64.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/014-3f0b30771662.png\" \/><\/p>\n<p>A humor blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/015-54f1fc5222d9.png\" \/><\/p>\n<p>Even, a course-selling blogger:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/016-7e33e84ed5e4.png\" \/><\/p>\n<p>The usage for text-only covers is largely the same.<\/p>\n<p>The first four columns are for input. Like the image-text cover, &#8216;Note Topic&#8217; is required, while the other three are optional. However, &#8216;Style Type&#8217; and &#8216;Background Preset&#8217; have many built-in variations, so it&#8217;s recommended to select them.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/017-432dceca9caa.png\" \/><\/p>\n<p>The subsequent steps are the same as with image-text notes.<\/p>\n<p>Using this table, you can create covers with text as the main stylistic element:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/018-cf883f054ce5.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/004-74b36b4ec049.png\" \/><\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/020-81c70b1f7ea0.png\" \/><\/p>\n<p>Some tips:<\/p>\n<p>1. Theoretically, you can support generating images directly within the multi-dimensional table via API calls.<\/p>\n<p>I didn&#8217;t test this as I ran out of API credits, but I&#8217;ve included a template in the workflow that uses Silicon Flow to call image-to-image generation. Feel free to adjust and test it. For details, refer to item 9 in this article:<a href=\"https:\/\/blog.liu-qi.cn\/2025\/03\/07\/%E4%B8%80%E4%BA%9B%E9%A3%9E%E4%B9%A6%E5%A4%9A%E7%BB%B4%E8%A1%A8%E6%A0%BC%E7%9A%84ai%E4%BD%BF%E7%94%A8%E7%BB%8F%E9%AA%8C%E5%88%86%E4%BA%AB\/\">Sharing some AI usage experiences with Feishu multi-dimensional tables<\/a><\/p>\n<p>2. The style presets in the image-text table may become ineffective. For example, in the earliest version, I had included &#8216;Araki JOJO&#8217; and &#8216;Makoto Shinkai anime&#8217; style presets, but both are now invalid.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/021-66ef3b194b61.png\" \/><\/p>\n<p>After creating a table from the template, you can directly edit the fields, delete or add new options.<\/p>\n<p>3. Although GPT-4o has significantly improved in text generation, it still exhibits noticeable flaws with complex tasks. Both poster copy and prompt length are recommended to be kept concise.<\/p>\n<p>Based on my testing, the recommended model for prompt generation is<strong>Doubao-1.5-256K<\/strong>(The 32K prompt is slightly longer), offering a balance between scene description and prompt length. For stable output, the default AI model used in this table is<strong>Doubao-1.5-256K<\/strong>model, and you can experience the results for yourself.<\/p>\n<p>Additionally, the shortcut for selecting a custom service provider&#8217;s API field remains an old issue:<strong>When creating the template, the API Key isn&#8217;t reset, so it will continue to consume the API credits I&#8217;ve entered<\/strong>. Therefore, I kindly ask those with APIs and who know how to register a Volcano account to replace the API in the<strong>&#8216;Image Prompt&#8217; field with your own API after using the template.<\/strong>\u3002<\/p>\n<p>For those who don&#8217;t know how to register an account, it&#8217;s okay\u2014you can still use my API for now. However, I can&#8217;t guarantee it will work long-term. If costs become too high, I might deactivate this key, so please understand.<\/p>\n<p><strong>4.<\/strong>Besides the default Doubao model, I&#8217;ve also retained DeepSeek-V3 version prompt generation in the hidden fields of both sub-tables.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/022-d2bca9f212be.png\" \/><\/p>\n<p>Although, given GPT-4o&#8217;s current image generation capabilities, the direct success rate with this version is relatively lower. However, in terms of copy and visual richness, it&#8217;s noticeably better than Doubao 256K. With further manual processing in Photoshop, detailed dialogue adjustments, or generating and merging transparent layers, the results can still be quite usable.<\/p>\n<p>Here are some examples of the V3 version prompts outputting directly:<\/p>\n<p>Image-text \u2b07\ufe0f<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/023-38bdbdca0ca8.png\" \/><\/p>\n<p>Text-only \u2b07\ufe0f<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/024-5529775f791b.png\" \/><\/p>\n<p>Text-only covers are naturally slightly more stable than image-text ones. In the demo result examples in the table, each note topic has one Chinese result from Doubao and one from V3.<\/p>\n<p>As you can see, the copy and visual richness generated by V3 are somewhat better.<\/p>\n<p>Also, 4o&#8217;s English text output is usually more stable than its Chinese. If you&#8217;re using it in English scenarios, you might consider defaulting to V3.<\/p>\n<p>Comparison of generation results for two prompts translated into English:<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/025-58d70799a7c6.png\" \/><\/p>\n<p>5. If you need to adjust the default copy generation requirements and visual style, locate the formula field named &#8216;Prompt&#8217; and make adjustments.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/026-ce24b2e4d912.png\" \/><\/p>\n<p>For everyone&#8217;s convenience in modifying and adjusting, I&#8217;ve also included the prompts here.<\/p>\n<p>Mainly the two core prompts for image-text and text-only.<\/p>\n<p>These two prompts are mostly derived from AI analysis, with only minor manual tweaks.<\/p>\n<p>As a related industry practitioner, heh, I directly downloaded a batch of high-view-rate covers from the Xiaohongshu (Little Red Book) Juguang platform backend and had AI analyze them. Then I summarized most of the copy and visual element-related prompts.<\/p>\n<p><img decoding=\"async\" alt=\"\" loading=\"lazy\" src=\"https:\/\/blog.liu-qi.cn\/wp-content\/uploads\/2025\/03\/027-72a469a5da9a.png\" \/><\/p>\n<p>Engaging in a bit of &#8216;cutting corners&#8217; with fellow AI enthusiasts in the industry, then showing off AI skills in front of peers\u2014a philosophy of playful mischief XD.<\/p>\n<p>The prompts are as follows:<\/p>\n<p>Image-text type:<\/p>\n<pre><code>Please generate a detailed prompt for AI drawing based on the topic I provide: \"{Topic}\", to help create a high click-rate Xiaohongshu (Little Red Book) style cover image.\n<\/code><\/pre>\n<p>Text-only type:<\/p>\n<pre><code>Please create a high click-rate Xiaohongshu (Little Red Book) cover image with the topic: [Topic].\n<\/code><\/pre>\n<p>(The &#8216;Style Type&#8217; and &#8216;Background Preset&#8217; have multiple design variations, changing with the table&#8217;s options and concatenated via formulas. For the specific settings of different styles and backgrounds, check the formulas in the table.)<\/p>\n<p>Have fun~<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This guide introduces a multi-dimensional table template to generate Xiaohongshu cover images using GPT-4o&#8217;s multimodal capabilities with text prompts.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24],"tags":[7,13,18,12],"class_list":["post-136","post","type-post","status-publish","format-standard","hentry","category-articles","tag-ai-","tag-13","tag-18","tag-12"],"_links":{"self":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts\/136","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/comments?post=136"}],"version-history":[{"count":0,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/posts\/136\/revisions"}],"wp:attachment":[{"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/media?parent=136"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/categories?post=136"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/en.blog.liu-qi.cn\/index.php\/wp-json\/wp\/v2\/tags?post=136"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}