
Black Forest Labs launched 4 new artificial intelligence (AI) instruments for its base text-to-image Flux.1 AI mannequin final week. These 4 AI instruments are primarily based on separate fashions designed to execute particular picture enhancing duties throughout the picture generator. The corporate claims that these instruments will supply granular management over the output photos and let customers protect the important thing components whereas experimenting with completely different kinds. The enhancing instruments can be found individually as developer fashions in open entry and professional fashions.
In a blog post, the AI agency detailed the 4 new picture enhancing instruments for Flux.1 AI mannequin. Builders can open-access the 4 instruments in separate fashions throughout the dev mannequin collection, whereas customers will get entry to the complete model through BFL API.
Immediately, we’re excited to launch FLUX.1 Instruments, a collection of fashions designed so as to add management and steerability to our base text-to-image mannequin FLUX.1, enabling the modification and re-creation of actual and generated photos. Be taught extra in our blogpost: https://t.co/J5Bc8fVGEc pic.twitter.com/7lEl74XYV4
— Black Forest Labs (@bfl_ml) November 21, 2024
The Flux.1 Fill is an inpainting and outpainting software that may edit the main points inside a picture or increase the boundaries of a picture utilizing textual content prompts and a binary masks. Based mostly on inner testing, the corporate claimed that the professional model of the software outperforms competing instruments equivalent to Ideogram 2.0. The developer model of the software is out there underneath the Flux Dev License and could be found on Hugging Face and GitHub. The professional model could be accessed through the BFL API.
Flux.1 Depth and Flux.1 Canny instruments let customers carry out structural conditioning of output throughout picture transformations. The Depth software preserves the generated picture’s construction via depth maps and retains it intact whereas customers make a text-guide edit. Equally, the Canny software preserves the construction by accessing the output’s canny edges. These are useful throughout retexturing-based edits.
The corporate claimed that the instruments outperform related instruments supplied by opponents equivalent to Midjourney and InstantX. The complete model gives most efficiency whereas the Low-rank adaptation (LoRA) model for builders permits for simpler deployment. It may be discovered here.
Lastly, the Flux.1 Redux permits customers to generate picture variations primarily based on an enter picture. Black Forest Labs claims the software can reproduce the picture with slight variation, which could be refined additional. It additionally permits picture restyling through prompts. Additionally, this software is supported by the Flux1.1 [pro] Extremely, the corporate’s flagship picture technology mannequin. The mannequin weights could be discovered here.
All the AI fashions will even be obtainable through third-party platforms equivalent to Fal.ai, Replicate, Collectively.ai, Freepik, and Krea.ai.
Catch the most recent from the Client Electronics Present on Devices 360, at our CES 2025 hub.