Top Qs
Timeline
Chat
Perspective

Flux (text-to-image model)

Image-generating machine learning model From Wikipedia, the free encyclopedia

Flux (text-to-image model)
Remove ads

Flux (also known as FLUX.1) is a text-to-image model developed by Black Forest Labs, based in Freiburg im Breisgau, Germany. Black Forest Labs was founded by former employees of Stability AI. As with other text-to-image models, Flux generates images from natural language descriptions, called prompts.

Quick Facts Original author(s), Developer(s) ...
Remove ads

History

Summarize
Perspective

Black Forest Labs was founded in 2024 by Robin Rombach, Andreas Blattmann, and Patrick Esser, former employees of Stability AI.[2][3] All three founders had previously researched the artificial intelligence image generation at Ludwig Maximilian University of Munich as research assistants under Björn Ommer.[4][5][6] They published their research results on image generation in 2022, which resulted in creation of Stable Diffusion.[6][7] Investors in Black Forest Labs included venture capital firm Andreessen Horowitz, Brendan Iribe, Michael Ovitz, Garry Tan, and Vladlen Koltun.[8] The company received an initial investment of US$31 million.[9][10]

In August 2024, Flux was integrated into the Grok chatbot developed by xAI and made available as part of premium feature on X (formerly Twitter).[11][12][13][14] Grok later switched to its own text-to-image model Aurora in December 2024.[15]

On 18 November 2024, Mistral AI announced that its Le Chat chatbot had integrated Flux Pro as its image generation model.[16][17]

On 21 November 2024, Black Forest Labs announced the release of Flux.1 Tools, a suite of editing tools designed to be used on top of existing Flux models. The tools consisting of Flux.1 Fill for inpainting and outpainting, Flux.1 Depth for control based on extracted depth map of input images and prompts, Flux.1 Canny for control based on extracted canny edges of input images and prompts, and Flux.1 Redux for mixing existing input images and prompts. Each tools are available in both Pro and Dev variants.[18][19]

In January 2025, Black Forest Labs announced a partnership with Nvidia for inclusion of Flux models as foundation models for Nvidia's Blackwell microarchitecture.[20] The company also announced the release of Flux Pro Finetuning API, designed for customisation and fine-tuning of Flux-generated images and a partnership with German media company Hubert Burda Media for usage of Flux Pro as part of content creation.[21]

On 29 May 2025, Black Forest Labs announced Flux.1 Kontext, a suite of models that enable in-context image generation and editing, allowing users to prompt with both text and images.[22][23] Alongside this, they launched the BFL Playground, an interface for testing Flux models.[22][23]

Remove ads

Models

Summarize
Perspective
Thumb
Thumb
Demonstration of Flux.1 Kontext (Pro) ability to modify an existing image

Flux is a series of text-to-image models. The models are based on rectified flow transformer blocks scaled to 12 billion parameters.[8][24] Flux.1 models were released under different licences with Schnell (meaning Fast or Quick in German language) released as open-source software under Apache License, Dev released as source-available software under a non-commercial licence, and Pro released as proprietary software and only available as API that can be licensed by third-party users.[25][26] Users retained the ownership of resulting output regardless of models used.[27][28]

The models can be used either online or locally by using generative AI user interfaces such as ComfyUI and Stable Diffusion WebUI Forge (a fork of Automatic1111 WebUI).[8][29]

An improved flagship model, Flux 1.1 Pro was released on 2 October 2024.[30][31] Two additional modes were added on 6 November, Ultra which can generate image at four times higher resolution and up to 4 megapixel without affecting generation speed and Raw which can generate hyper-realistic image in the style of candid photography.[32][33][34]

Flux.1 Kontext is a series with in-context image generation and editing capabilities. It is available in Pro and Max variants with Dev variant in private beta. Pro is the highest quality model and can be used to iteratively modify an existing image by using prompt while Max is optimised for speed of generation. Dev will be an open-weight model.[22]

Related to Flux is text-to-video model SOTA, under development as of June 2025.[8]

Remove ads

Reception

Summarize
Perspective

According to a test performed by Ars Technica, the outputs generated by Flux.1 Dev and Flux.1 Pro are comparable with DALL-E 3 in terms of prompt fidelity, with the photorealism closely matched Midjourney 6 and generated human hands with more consistency over previous models such as Stable Diffusion XL.[35]

Flux has been criticised for its very realistic generated images. According to media reports, depictions ranged from an image of Donald Trump posing with guns to disturbing scenes, which triggered discussions about ethical implications of technologies developed by Black Forest Labs.[4][13]

After the release of the model, social media platform X was flooded with Flux-generated images.[36][37] Black Forest Labs has not provided exact details of the data used to train the model.[32] Ars Technica suspected that Flux is based on a large, unauthorised collection of images scraped from the internet, a controversial practice with potential legal consequences.[35][38]

According to a test performed by Japanese technology news website Gigazine for Flux.1 Kontext, the model series has a good understanding in English language and can easily transfer style of image from photorealistic into anime-style according to prompts given by the user, however its capability to understand Japanese language is quite poor.[39]

Availability

In addition to the official BFL Playground on their website,[40] the Flux models are also widely available through various third-party platforms for creative and professional use. These include repositories on platforms like Hugging Face[41] and Replicate.[42]

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads