Âé¶¹ÒùÔº


Deepfake whales could be a key conservation tool

Deepfake whales could be key conservation tool
A North Atlantic right whale generated by AI. Credit: Duke MaRRS Lab

Scrolling through social media, you may have dallied on reels of Leonardo DiCaprio dancing or Tom Cruise crooning, only to realize they're spoofs created with artificial intelligence. Hyper-realistic videos and images like these—also called deepfakes—are notorious for celebrity pranking. But the technology has serious scientific applications, too. In the field of ecology, for example, AI doppelgängers of rare species could improve efforts to understand, monitor and protect them.

Specifically, wildlife deepfakes could help train AI models to detect wildlife in footage from satellites, planes and drones. Ecologists increasingly rely on such bird's-eye imagery to study species behavior and population trends.

"We are truly in the age of big data when it comes to remote sensing in ecology and conservation," says Dave Johnston, director of the Marine Robotics and Remote Sensing Lab at Duke's Nicholas School of the Environment. "Over the past two decades, our ability to collect high-resolution remote-sensing imagery has grown exponentially, largely due to advances in drone technology and increased satellite capabilities."

Augmenting data

Traditionally, researchers had to use their own eyes to scour satellite and aerial images for target species. Now, AI detection tools can expedite the process. The key is in the data used to train the computer models. The models need to "see" lots and lots of realistic examples of a species to know what to look for in field footage.

For some common wildlife, copious footage exists, so assembling training data is fairly easy. But footage is often limited for species that are rare, that blend into their surroundings or that live in inaccessible areas, such as war zones.

"One of the big challenges in ecology is the idea of data scarcity," says Henry Sun, a 2025 graduate of the Nicholas School who double-majored in biology and marine science and conservation, with a minor in computer science. "For a species where you only have several hundred individuals, you're just not going to have diverse enough images to be able to train a good AI detection model."

What's an ecologist to do? One promising option is to beef up scant training data with AI-generated, or synthetic, data—in essence, deepfakes. This approach, called data augmentation, could enable new ecological insights, according to a in Nature.

Sun, a former Nicholas School Rachel Carson Scholar and North Carolina Space Grant Undergraduate Research Scholar, recently investigated the topic of data augmentation for his senior thesis, which he plans to publish. Specifically, he explored whether AI could produce images realistic enough to supplement drone footage of the critically endangered North Atlantic , whose population has declined to fewer than 400 individuals. Theoretically, synthetic data could be used to help train other AI tools to detect North Atlantic right whales in real aerial footage.

Sun's research was inspired by a larger collaboration between the Nicholas School and several Canadian organizations—including the Canadian Space Agency, Fisheries and Oceans Canada, the University of New Brunswick and the environmental consulting group Hatfield Consultants—to build a space-based detection system for North Atlantic right whales, which are notoriously elusive, in large part because of their small population.

"There's a lot of ocean, and despite the fact that whales are coming back, there's still a very small number of them compared to the area that you have to search," says Johnston, who was Sun's thesis advisor. "And so that means we need very efficient tools to find them. But it also means that we don't often have really good archives of data to train those models to identify them."

To create deepfake whales, Sun and his team used diffusion models, which generate images in response to prompts in the form of descriptive text, an exemplary image or both. Although other researchers have for use in whale detection, the team says it's the first to use diffusion models for this purpose.

The researchers used several commercially available diffusion models that are pre-trained on reams of internet data. In other words, these base models, as they're known, are primed to produce a variety of images in response to prompts.

Sun and his team experimented with several methods of image generation, first using text prompts, then image prompts, and finally a method called fine-tuning, which included text and image prompts. Fine-tuning is a way to improve the performance of a base model for a specific task—in this case, producing high-resolution, hyper-realistic whale photos—by further training it on a smaller, more specific dataset.

"Sometimes the diffusion model produces anatomically deformed whale images, like whales that are conjoined or whales with multiple sets of fins, which shows that it hasn't exactly learned the most accurate representation yet," Sun explains. Fine-tuning can teach the model to avoid those mistakes.

Deepfake whales could be key conservation tool
Sometimes the models created anatomically deformed whales, like this two-tailed humpback. Credit: Duke MaRRS Lab

Testing credibility

All told, the team created hundreds of aerial images of North Atlantic right whales, and for comparison, hundreds of aerial images of humpback whales. Because far more real-life footage of humpbacks is available for training generative AI, the team hypothesized that their models would produce more realistic synthetic humpback imagery.

The last step was to test the veracity of their deepfake whales. Were they credible? To answer that question, the researchers fed their photos into a Google tool called Reverse Image Search, which analyzes an input image, searches the internet for similar pictures, and produces results. In this case, the goal was to see if Google could recognize the whales depicted in the synthetic data and return images of the same species.

In the fake photos produced by text or image prompts, Google mistook many North Atlantic right whales for humpbacks. By contrast, it correctly identified both species of whale in almost all of the images produced through fine-tuning.

The team also found that images of North Atlantic right whales created through fine-tuning were more accurate than those generated with text or image prompts alone.

The next phase of the research is to investigate whether synthetic whale imagery can indeed supplement for AI detection models. As a starting point, Sun enlisted Duke undergraduate Max Niu to begin basic testing.

"Max has been training deep-learning models using both real images and some of the fake images that I've made," Sun says. The idea is "to see if there's a proportion of fake images that will benefit the model."

Walking the line

This fall, Sun will continue his studies as a Ph.D. student at the Duke Marine Lab, working in the lab of Juliet Wong. Although he plans to turn his attention from whales to sea urchins, he is committed to helping demystify AI for researchers.

"Something that I'm extremely interested in is capacity-building for natural scientists in the realm of artificial intelligence, because I think increasingly, these are skills that everyone needs," Sun says. To that end, Sun hopes to plan AI-related outreach events, such as the hour-long session he hosted during Duke Oceans Week last March about using AI in ocean science.

As more ecologists turn to AI, however, ethical considerations will become more pressing, says Holly Houliston, a Ph.D. student with the British Antarctic Survey and the University of Cambridge, who helped supervise Sun's work as a visiting scholar at the Marine Lab. The data centers that power AI are energy- and water-intensive, so practices like generative AI data augmentation should be used conservatively in targeted ways, according to Houliston.

"You have to be really clear on the ecological question you're trying to answer. For example, if you are looking at calves—so, baby whales—then you might want to generate more images of these younger animals because you probably only have a few. But if you've got a balanced dataset, then maybe you don't need to generate more synthetic imagery," Houliston says. "The use of diffusion models and generative AI in general has environmental impacts. Studies like this can help us ecologists understand how to responsibly use them."

As Johnston notes, "this intersection between computer science and environmental sciences is only going to grow."

More information: Kasim Rafiq et al, Generative AI as a tool to accelerate the field of ecology, Nature Ecology & Evolution (2025).

Journal information: Nature , Nature Ecology & Evolution

Provided by Duke University

Citation: Deepfake whales could be a key conservation tool (2025, August 13) retrieved 13 August 2025 from /news/2025-08-deepfake-whales-key-tool.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Whale images used for artificial intelligence research

0 shares

Feedback to editors