Are You Seeing Scars Too?: Examining Gray Whale Scars and Skin Conditions

By Serina Lane, GEMM Lab NSF REU Intern, Georgia Gwinnett College

Hello, everyone! My name is Serina and I’m a Research Experience for Undergraduates (REU) Intern at the Hatfield Marine Science Center (HMSC) this summer. I’ve had a love for the ocean for as long as I can remember. Honestly, it started off with just dolphins, but I soon started to realize that the ocean is full of fascinating creatures!

How I ended up here…well, I’ve never been to Oregon, I’m escaping the hot weather of Georgia, but I’m also getting to interact with like-minded marine biologists and experienced individuals at an amazing marine laboratory. At the age of 29, I’m also an older undergraduate student, and I will be graduating soon! I took a very long break from academics and coming back was hard, especially switching from business to biology. I have participated in surveys that asked how I felt about the statement “I am a scientist,” along with the degrees of agree and disagree. For most of my undergraduate career, I picked “slightly disagree”. I was getting great grades, but I did not feel like I was ever going to be able to accomplish the type of work scientific papers are written about. I really felt the need to gain more experience in the career path I intended to follow. All of these are the whirlwind ingredients that went into applying for the HMSC REU Internship at OSU! I’m being mentored by the lovely Natalie Chazal and Leigh Torres, and I am grateful for the opportunity and very excited to experience everything Hatfield has to offer. A little over a week of being here, I already feel my answer sliding from “neutral” to even “slightly agree”. There is still so much to learn!

The project I’m helping with is analyzing the scarring and skin conditions of Eastern North Pacific gray whales alongside the GRANITE team. My job will be analyzing over 100,000 pictures from the past eight years to detect various scars and potential skin conditions (yes, the comma is in the correct spot and no, there are no extra 0’s). Scars can come from a variety of sources such as boat propellers, fishing gear, and killer whales! A study conducted by Corsi et al. consisted of documenting killer whale rake marks (bites, essentially) on different types of whales in the eastern North Pacific. Their results showed that gray whales had the highest percentage of observed rake marks in sighted individuals, and provided insight into why body sections of observed marks are important. Most baleen whales had rake marks predominantly on their flukes, because they are often used for defense and if fleeing, are the closest area to bite. Fascinatingly, Corsi et al. consider that the higher occurrences of gray whale rake marks are due to killer whales adopting species-specific hunting approaches. Gray whales have predictable migratory routes, and we already know how intelligent killer whales can be. If I knew a truck had a specific delivery route and I could wait to intercept a fresh delivery of Krispy Kreme donuts, why wouldn’t I? 

Donuts aside, I’ll also be categorizing where the scars/skin conditions are located – for example, certain regions on the tail (like above) or on their left or right back (often due to boat collisions). Then I’ll define what I believe to be the source of scarring and rate my confidence in that decision based on the photo. Now, not all of the photos are clear enough for me to make informed decisions, so realistically I could end up with only a few hundred usable photos. At the end of the summer, we’ll gather the results and compare the different rates of scarring sources and the body parts where they occurred, and analyze any patterns in skin conditions, such as whether a skin condition has worsened or improved on an individual we have sighted multiple times over the years.

 Figure 1. A little look into a table I made to give examples of what scarring from different sources look like.

Surprisingly, cetaceans can heal deep wounds on their own without medical intervention. Scientists have discovered that compounds in their blubber layer, such as organohalogens and isovaleric acid, may naturally fight off infections and help wounds heal faster. Unlike humans and other terrestrial animals that form scabs when injured, cetaceans develop a different protective layer over their wounds. This layer consists of degenerative cells mixed with tiny bubbles and covers the injured area. This unique adaptation might help protect the wound from seawater and other environmental factors. While there have been studies on how surface wounds heal in captive dolphins and whales, there’s still much to learn about how these animals heal large, deep wounds. Understanding how wounds heal can help us to more accurately assess the frequency at which whales are wounded, whether it be from fishing gear or boats, to cookie cutter sharks or killer whales.

It seems like a lot, and it is, but our ultimate goal is to assess the effects that scarring and skin conditions can have in the ecology of marine megafauna. Assessing the individual gray whales in the photos can provide a bigger picture of the health of a whole population. We can also look for any patterns of skin conditions between mother and calf, individuals that are around each other often, adults and juveniles, or males and females. Scars may also play a role in a population’s health. If a gray whale had an open wound previously, did it develop into a skin condition? Did a skin condition worsen? Did it leave them more vulnerable to predators? These are the questions we would like to elaborate on with this research. A great read on this topic was conducted by Dawn R. Barlow, Acacia L. Pepper and Leigh G. Torres, which will be in the references below (Barlow et al., 2019). A better understanding of potential patterns is a better assessment of our current marine management practices. Is it enough, or do we need to change and do more?

Okay, lastly, let’s talk about artificial intelligence (AI). Would using AI methods for this project make our lives easier? Yes. If we could train AI to accurately identify specific scars and skin conditions, our 100,000 photos could be done within minutes. For my job security, woo no AI! But on a serious note, this approach could free up time that could be spent on other efforts, or speed up the process of assessing marine management. However, we gain so much by reviewing the photos ourselves which is still important to do when training AI on what specifics to search for. Over the summer, I’m going to get to know different whales and see how they may change over 8 years, just by their pictures. My excitement grew as soon as I looked at my first 3 gray whales and learned their names. It’s forever important to remember that we can always learn from sharing connections with the organisms we study and interact with. We share the same planet and we have to work together to preserve it. I thank you all for taking a trip through our summer research with me and I hope to meet some of you around Hatfield!

References

Barlow, D. R., Pepper, A. L., & Torres, L. G. (2019a). Skin deep: An assessment of New Zealand blue whale skin condition. Frontiers in Marine Science, 6. https://doi.org/10.3389/fmars.2019.00757 

Bradford, A. L., Weller, D. W., Ivashchenko, Y. V., Burdin, A. M., & Brownell, Jr, R. L. (2009). Anthropogenic scarring of Western Gray Whales (Eschrichtius robustus). Marine Mammal Science, 25(1), 161–175. https://doi.org/10.1111/j.1748-7692.2008.00253.x 

Corsi, E., Calambokidis, J., Flynn, K. R., & Steiger, G. H. (2021). Killer whale predatory scarring on Mysticetes: A comparison of rake marks among blue, humpback, and gray whales in the eastern North Pacific. Marine Mammal Science, 38(1), 223–234. https://doi.org/10.1111/mms.12863 

NOAA. (2020, April 4). Fisheries of the United States. https://www.fisheries.noaa.gov/national/sustainable-fisheries/fisheries-united-states

Hamilton, P. K., & Marx, M. K. (2005). Skin lesions on North Atlantic right whales: Categories, prevalence and change in occurrence in the 1990s. Diseases of Aquatic Organisms, 68, 71–82. https://doi.org/10.3354/dao068071 

Pettis, H. M., Rolland, R. M., Hamilton, P. K., Brault, S., Knowlton, A. R., & Kraus, S. D. (2004). Visual health assessment of north atlantic right whales (Eubalaena glacialis) using photographs. Canadian Journal of Zoology, 82(1), 8–19. https://doi.org/10.1139/z03-207 

Silber, G. K., Weller, D. W., Reeves, R. R., Adams, J. D., & Moore, T. J. (2021). Co-occurrence of gray whales and vessel traffic in the North Pacific Ocean. Endangered Species Research, 44, 177–201. https://doi.org/10.3354/esr01093 Sun, L., Engle, C., Kumar, G., & van Senten, J. (2022). Retail market trends for Seafood in the United States. Journal of the World Aquaculture Society, 54(3), 603–624. https://doi.org/10.1111/jwas.12919

Blubber and Barnacles: An Introduction to Cetacean Skin Disease

By Natalie Chazal, PhD student, OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

Ever noticed how our skin gets pruny and overly soft after just ten minutes in the water? That’s because human skin is adapted for life on dry land, where retaining moisture is a primary concern. In contrast, cetaceans have evolved remarkable adaptations to thrive in the cold, salty ocean water for their entire lives. Understanding cetacean skin is crucial for conservation efforts, as it allows us to monitor and assess the overall health of these marine populations. By analyzing skin conditions, we can identify scarring patterns and lesions that may indicate interactions with human activities, such as entanglements or boat strikes, which can inform more effective risk assessment and mitigation strategies. Additionally, tracking the progression of skin diseases provides vital information on the prevalence and impact of pathogens, in order to guide more targeted management strategies to improve whale health and population resilience in their changing environments. To fully appreciate why monitoring skin diseases in cetaceans matters, let’s first explore the anatomy and physiology of cetacean skin and understand how scarring and diseases occur.

Whale skin has similar layers to our own, but modified over millions of years of evolution. Thicker than terrestrial mammals, the epidermis (the outermost layer) in marine mammals is designed to help maintain hydration in a hyperosmotic (very salty) environment where water is trying to flow into the cells of the whale. This top layer sloughs off at the surface as new cells are continuously renewed. The hypodermis, or blubber layer, is composed of primarily vascularized fat cells which insulate, store energy, and regulate buoyancy (Figure 1). 

Figure 1. Major layers of whale skin with the pop up showing a detailed figure of the epidermal/hypodermal junction (Mouton et al. 2011). 

Some other interesting skin adaptations that allow whales to maximize their efficiency underwater include near hairlessness, no sweat glands, and high levels of melanin. First, cetacean hairlessness helps them reduce drag in the water, but they don’t quite lack all hair. Most species of whales have hair around their mouths when they’re developing in the womb and then lose their hair either before birth or shortly after. Some species, like the humpback, have tubercles that are modified hair follicles to help them sense their surroundings, similar to whiskers on a dog. Second, because sweating is not effective for thermoregulation in the aquatic environment, whales have lost the sweat gland structure in their skin, making it slightly less permeable than terrestrial mammals. Their lack of glands also means that whales don’t secrete their own oils to maintain the moisture of the skin. So, if they’re exposed to dry air, their skin will dry out faster than the skin of terrestrial mammals. Lastly, melanin pigments vary from species to species. You can easily see this when we compare lateral surface photos of different species (Figure 2).

Figure 2. Comparison of surfacing photos between blue whales (upper left), Cuvier’s beaked whale (upper right), gray whale (lower left), and beluga whale (lower right) coloration. Blue and gray whale photos from GEMM Lab, beaked whale photo from Cascadia Research Collective (https://cascadiaresearch.org/files/Discriminating-between-Cuviers-and-Blainvilles-beaked-whales.pdf), and beluga whale photo from NOAA (https://www.fisheries.noaa.gov/event/2022-belugas-count)

This difference in coloration can be used by animals for camouflage either to avoid predators or to help ambush prey, and helps us to identify the species while they are at the surface. Coloration can also change as an animal ages and can help signal to us or other conspecifics the age or reproductive status of the individual (Caro et al. 2011). The melanin that creates these different colorations can protect whales against the harmful effects of UV radiation by absorbing and dissipating UV radiation, which decreases how far it penetrates into the skin, reducing cell damage (Morales-Guerrero et al. 2017). 

Thus, whale skin is very well adapted to the aquatic environment, from thick blubber layers to no sweat glands. However, despite these adaptations, cetaceans remain vulnerable to a range of pathogens. The major skin diseases documented in whales can fall into 4 categories: viral, bacterial, fungal, and parasitic. Viral infections in cetaceans involve the invasion of host cells, where viruses replicate and cause cell death or dysfunction, leading directly to skin lesions or nodules. Viruses can also manipulate the host immune response, suppressing immunity and exacerbating inflammation, which further contributes to skin damage. In contrast, fungal infections typically involve fungal growth and colonization on the skin surface or within tissues, with some fungi producing toxins that directly damage cells or provoke inflammatory responses (Espregueira et al. 2023). Bacterial infections in cetaceans often result from bacterial invasion and multiplication within skin tissues, accompanied by toxin production that damages cells and triggers a robust inflammatory response (Bressem et al. 2009). Parasitic infections, such as barnacle and whale lice infestations, can cause irritation, abrasions, and compromise the skin’s protective function, leading to localized inflammation and potential secondary infections. 

Understanding the specific causes of skin conditions in cetaceans is crucial because different pathogens spread through populations in distinct ways, impacting both individuals and population level health. Viral infections, for instance, can spread rapidly within populations through direct contact or respiratory droplets, potentially leading to widespread outbreaks and systemic effects. Fungal infections may persist in environmental reservoirs (spores of fungus can exist in seawater, sediment, organic marine debris, and the air) and can affect multiple individuals over time, particularly in conditions favoring fungal growth. Bacterial infections often spread through direct contact or contaminated environments, posing risks of localized outbreaks and secondary complications. Parasitic infestations, such as barnacles and whale lice, can transmit between individuals through close contact or shared habitat spaces (Romero et al 2012). By accurately identifying the causative agents of skin diseases, we can assess their transmission dynamics, anticipate population-level impacts, and implement targeted management strategies to mitigate disease spread and preserve whale health.

There are complex factors that contribute to skin disease prevalence in cetaceans. Environmental degradation, chemical pollution, climate change, and other anthropogenic stressors are known to lower immune systems, and degrade prey quality and quantity (Bressem et al. 2009). To understand the interactions between disease and the environment, we have to begin by establishing baseline health metrics. This summer, we will characterize an emerging skin disease in gray whales (see Zorro’s progression in Figure 3) using the photographs taken from the last 9 years of GRANITE fieldwork. Gray whales are particularly vulnerable to environmental threats because of their reliance on nearshore habitats. Unlike some other cetacean species that venture into deeper waters, gray whales are primarily coastal dwellers, feeding on benthic and epi-benthic organisms found in shallow, nutrient-rich waters. This dependence on nearshore environments exposes them to numerous risks. Pollution from runoff, oil spills, and plastic debris accumulates in these coastal waters, disrupting their immune systems leaving them more susceptible to disease. Climate change can induce shifts in the environment that alter the availability and quality of these habitats, potentially forcing them into proximity of other animals or places that harbor more disease. Habitat degradation due to coastal development and human activities like overfishing and increased vessel traffic further restricts their access to critical feeding areas (Bressem et al. 2009).

Figure 3. Comparisons of Zorro (a PCFG gray whale) between a year with no skin condition, 2020 (left panels) and this year where he came back covered in an unknown skin condition, 2024 (right panels). The upper panels capture his left side and the lower panels capture his right side.

These cumulative impacts increase the susceptibility of gray whales to diseases and stressors, highlighting the urgent need for baseline health assessments and identifying early signs of environmental stress (Stimmelmayr 2020). By documenting and analyzing skin conditions of gray whales through photographs, we can track changes over time and correlate them with environmental factors like pollution levels or habitat alterations. This non-invasive approach not only provides valuable insights into the prevalence and severity of skin diseases but also helps to understand broader ecological health trends in gray whale populations. 

P.S. Check out IndividuWhale to explore some great examples of how the skin condition of some of the local Oregon PCFG gray whales compare to each other and how we use their specific markings to help identify them in the field. 

References

Barlow, D.R., Pepper, A.L., Torres, L.G., 2019. Skin Deep: An Assessment of New Zealand Blue Whale Skin Condition. Frontiers in Marine Science 6.

Bressem, M.-F.V., Raga, J.A., Guardo, G.D., Jepson, P.D., Duignan, P.J., Siebert, U., Barrett, T., Santos, M.C. de O., Moreno, I.B., Siciliano, S., Aguilar, A., Waerebeek, K.V., 2009. Emerging infectious diseases in cetaceans worldwide and the possible role of environmental stressors. Diseases of Aquatic Organisms 86, 143–157. https://doi.org/10.3354/dao02101

Callewaert, C., Ravard Helffer, K., Lebaron, P., 2020. Skin Microbiome and its Interplay with the Environment. Am J Clin Dermatol 21, 4–11. https://doi.org/10.1007/s40257-020-00551-x

Caro, T., Beeman, K., Stankowich, T., Whitehead, H., 2011. The functional significance of colouration in cetaceans. Evol Ecol 25, 1231–1245. https://doi.org/10.1007/s10682-011-9479-5

Espregueira Themudo, G., Alves, L.Q., Machado, A.M., Lopes-Marques, M., da Fonseca, R.R., Fonseca, M., Ruivo, R., Castro, L.F.C., 2020. Losing Genes: The Evolutionary Remodeling of Cetacea Skin. Front. Mar. Sci. 7. https://doi.org/10.3389/fmars.2020.592375

Menon, G.K., Elias, P.M., Wakefield, J.S., Crumrine, D., 2022. CETACEAN EPIDERMAL SPECIALIZATION: A REVIEW. Anat Histol Embryol 51, 563–575. https://doi.org/10.1111/ahe.12829

Morales-Guerrero, B., Barragán-Vargas, C., Silva-Rosales, G.R., Ortega-Ortiz, C.D., Gendron, D., Martinez-Levasseur, L.M., Acevedo-Whitehouse, K., 2017. Melanin granules melanophages and a fully-melanized epidermis are common traits of odontocete and mysticete cetaceans. Veterinary Dermatology 28, 213-e50. https://doi.org/10.1111/vde.12392

Mouton, M., Botha, A., Mouton, M., Botha, A., 2012. Cutaneous Lesions in Cetaceans: An Indicator of Ecosystem Status?, in: New Approaches to the Study of Marine Mammals. IntechOpen. https://doi.org/10.5772/54432

Pitman, R.L., Durban, J.W., Joyce, T., Fearnbach, H., Panigada, S., Lauriano, G., 2020. Skin in the game: Epidermal molt as a driver of long-distance migration in whales. Marine Mammal Science 36, 565–594. https://doi.org/10.1111/mms.12661

Romero, A., Keith, E.O., 2012. New Approaches to the Study of Marine Mammals. BoD – Books on Demand.

Stimmelmayr, R., Gulland, F.M.D., 2020. Gray Whale (Eschrichtius robustus) Health and Disease: Review and Future Directions. Frontiers in Marine Science 7.

Su, C.-Y., Hughes, M.W., Liu, T.-Y., Chuong, C.-M., Wang, H.-V., Yang, W.-C., 2022. Defining Wound Healing Progression in Cetacean Skin: Characteristics of Full-Thickness Wound Healing in Fraser’s Dolphins (Lagenodelphis hosei). Animals (Basel) 12, 537. https://doi.org/10.3390/ani12050537

Van Bressem, M.-F., Van Waerebeek, K., Duignan, P.J., 2022. Tattoo Skin Disease in Cetacea: A Review, with New Cases for the Northeast Pacific. Animals 12, 3581. https://doi.org/10.3390/ani12243581

Significant others? Thinking beyond p-values in science

By Natalie Chazal, PhD student, OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

Scientific inquiry relies on quantifying how certain we are of the differences we see in observations. This means that we must look at phenomena based on probabilities that we calculate from observed data, or data that we collect from sampling efforts. Historically, p-values have served as a relatively ubiquitous tool for assessing the strength of evidence in support of a hypothesis. However, as our understanding of statistical methods evolves, so does the scrutiny surrounding the appropriateness and interpretation of p-values. In the realm of research, the debate surrounding the use of p-values for determining statistical significance has sparked some controversy and reflection within the academic community. 

What is a p-value?

To understand the debate itself, we need to understand what a p-value is. The p-value represents the probability of obtaining a result as extreme as, or more extreme than, the observed data, under the assumption that there is no true difference or relationship between groups or variables. Traditionally, a p-value below a predetermined threshold (often 0.05) is considered statistically significant, suggesting that the observed data are unlikely (i.e., a 5% probability) to have occurred by chance alone. Many statistical tests provide p-values, which gives us a unified framework for interpretation across a range of analyses.

To illustrate this, imagine a study aimed at investigating the effects of underwater noise pollution on the foraging behavior of gray whales. Researchers collect data on the diving behavior of gray whales in both noisy and quiet regions of the ocean.

Drawings of gray whales with tags (depicted by orange shapes) in quiet areas (left) and noisy areas (right). 

In this example, the researchers hypothesize that gray whales stop foraging and ultimately change their diving behavior in response to increased marine noise pollution. The data collected from this hypothetical scenario could come from tags equipped with sensors that record diving depth, duration, and location, allowing us to calculate the exact length of time spent foraging. Data would be collected from both noisy areas (maybe near shipping lanes or industrial sites) and quiet areas (more remote regions with minimal human activity). 

To assess the significance of the differences between the two noise regimes, researchers may use statistical tests like t-tests to compare two groups. In our example, researchers use a t-test to compare the average foraging time between whales in noisy and quiet regimes. The next step would be to define hypotheses about the differences we expect to see. The null hypothesis (HN) would be that there is no difference in the average foraging time (X) between noisy and quiet areas: 

Scenario where the noisy area does not elicit a behavioral response that can be detected by the data collected by the tags (orange shapes on whales back). The lower graph shows the distribution of the data (foraging time) for the noisy and the quiet areas. The means of this data (X) are not different. 

And the alternative hypothesis (HA) would be that there is a difference between the noisy and quiet areas: 

Scenario where the noisy area elicits a behavioral response (swimming more towards the surface instead of foraging) that can be detected by the data collected by the tags (orange shapes on whales back). The lower graph shows the distribution of the data (foraging time) for the noisy and the quiet areas. The means of this data (X) are different with the noisy mean foraging time (pink) being lower than the quiet mean foraging time (blue).

For now, we will skip over the nitty gritty of a t-test and just say that the researchers get a “t-score” that says whether or not there is a difference in the means (X) of the quiet and noisy areas. A larger t-score means that there is a difference in the means whereas a smaller t-score would indicate that the means are more similar. This t-score comes along with a p-value. Let’s say we get a t-score (green dot) that is associated with a p-value of 0.03 shown as the yellow area under the curve: 

The t-score is a test statistic that tells us how different the means of our observed data groups are from each other (green dot). The area under the t-distribution that is above the t-score is the p-value (yellow shaded area).

A p-value of 0.03 means that there is a 3% probability of obtaining these observed differences in foraging time between noisy and quiet areas purely by chance, which assumes that the null hypothesis is true (that there is no difference). We usually compare this p-value to a threshold value to say whether this finding is significant. We set this threshold before looking at the results of the test. If the threshold is above our value, like 0.05, then we can “reject the null hypothesis” and conclude that there is a significant difference in foraging time between noisy and quiet areas (green check mark scenario). On the flip-side, if the threshold that we set before our results is too low (0.01), then we will “fail to reject the null hypothesis” and conclude that there was no significant difference in foraging time between noisy and quiet areas (red check mark scenario). The reason that we don’t ever “accept the null” is because we are testing an alternative hypothesis with observations and if those observations are consistent with the null rather than the alternative, this is not evidence for the null because it could be consistent with a different alternative hypothesis that we are not yet testing for.

When our pre-set threshold to determine significance is above or greater than our p-value that was calculated we have enough evidence to ‘reject the null hypothesis’ (left figure) whereas if our p-value is lower or smaller than our calculated p-value, then we ‘fail to reject the null hypothesis’ (right figure).

In this example, the use of p-values helps the researchers quantify the strength of evidence for their hypothesis and determine whether the observed differences in gray whale behavior are likely to be meaningful or merely due to chance. 

The Debate

Despite its widespread use, the reliance on p-values has been met with criticism. Firstly, because p-values are so ubiquitous, it can be easy to calculate them with or without enough critical thinking or interpretation. This critical thinking should include an understanding of what is biologically relevant and avoid the trap of using binary language like significant or non-significant results instead of looking directly at the uncertainty of your results. One of the other most common misconceptions about p-values is that they can measure the direct probability of the null hypothesis being true. As amazing as that would be, in reality we can only use p-values to understand the probability of our observed data. Additionally, it’s common to conflate the significance or magnitude of the p-value with effect size (which is the strength of the relationship between the variables). You can have a small p-value for an effect that isn’t very large or meaningful, especially if you have a large sample size. Sample size is an important metric to report. Larger number of samples generally means more precise estimates, higher statistical power, increased generalizability, and higher possibility for replication.

Furthermore, in studies that require multiple comparisons (i.e. multiple statistical analyses are done in a single study), there is an increased likelihood of observing false positives because each test introduces a chance of obtaining a significant result by random variability alone. In p-value language, a “false positive” is when you say something is significant (below your p-value threshold) when it actually is not, and a “false negative” is when you say something is not significant (above the p-value threshold) when it actually is. So, in terms of multiple comparisons, if there are no adjustments made for the increased risk of false positives, this can potentially lead to inaccurate conclusions of significance.

In our example using foraging time in gray whales, we didn’t consider the context of our findings. To make this a more reliable study, we have to consider factors like the number of whales tagged (sample size!), the magnitude of noise near the tagged whales, other variables in the environment (e.g. prey availability) that could affect our results, and the ecological significance in the difference in foraging time that was found. To make robust conclusions, we need to carefully build hypotheses and study designs that will answer the questions we seek. We must then carefully choose the statistical tests that we use and explore how our data align with the assumptions that these tests make. It’s essential to contextualize our results within the bounds of our study design and broader ecological system. Finally, performing sensitivity analyses (e.g. running the same tests multiple times on slightly different datasets) ensures that our results are stable over a variety of different model parameters and assumptions. 

In the real world, there have been many studies done on the effects of noise pollution on baleen whale behavior that incorporate multiple sources of variance and bias to get robust results that show behavioral responses and physiological consequences to anthropogenic sound stressors (Melcón et al. 2012, Blair et al. 2016, Gailey et al. 2022, Lemos et al. 2022).

Moving Beyond P-values

There has been growing interest in reassessing the role of p-values in scientific inference and publishing. Scientists appreciate p-values because they provide one clear numeric threshold to determine significance of their results. However, the reality is more complicated than this binary approach. We have to explore the uncertainty around these estimates and test statistics (e.g. t-score) and what they represent ecologically. One avenue to explore might be focusing more on effect sizes and confidence intervals as more informative measures of the magnitude and precision of observed effects. There has also been a shift towards using Bayesian methods, which allow for the incorporation of prior knowledge and a more nuanced quantification of uncertainty.

Bayesian methods in particular are a leading alternative to p-values because instead of looking at how likely our observations are given a null hypothesis, we get a direct probability of the hypothesis given our data. For example, we can use Bayes factor for our noisy vs quiet gray whale behavioral t-test (Johnson et al. 2023). Bayes factor measures the likelihood of the data being observed for each hypothesis separately (instead of assuming the null hypothesis is true) so if we calculate a Bayes factor of 3 for the alternative hypothesis (HA), we could directly say that it is 3 times more likely for there to be decreased foraging time in a noisy area than for there to be no difference in the noisy vs quiet group. But that is just one example of Bayesian methods at work. The GEMM lab uses Bayesian methods in many projects from Lisa’s spatial capture-recapture models (link to blog) and Dawn’s blue whale abundance estimates (Barlow et al. 2018) to quantifying uncertainty associated with drone photogrammetry data collection methods in KC’s body size models (link to blog). 

Ultimately, the debate surrounding p-values highlights the necessity of nuanced and transparent approaches to statistical inference in scientific research. Rather than relying solely on arbitrary thresholds, researchers can consider the context, relevance, and robustness of their findings. From justifying our significance thresholds to directly describing parameters based on probability, we have increasingly powerful tools to improve the methodological rigor of our studies. 

References

Agathokleous, E., 2022. Environmental pollution impacts: Are p values over-valued? Science of The Total Environment 850, 157807. https://doi.org/10.1016/j.scitotenv.2022.157807

Barlow, D.R., Torres, L.G., Hodge, K.B., Steel, D., Baker, C.S., Chandler, T.E., Bott, N., Constantine, R., Double, M.C., Gill, P., Glasgow, D., Hamner, R.M., Lilley, C., Ogle, M., Olson, P.A., Peters, C., Stockin, K.A., Tessaglia-Hymes, C.T., Klinck, H., 2018. Documentation of a New Zealand blue whale population based on multiple lines of evidence. Endangered Species Research 36, 27–40. https://doi.org/10.3354/esr00891

Blair, H.B., Merchant, N.D., Friedlaender, A.S., Wiley, D.N., Parks, S.E., 2016. Evidence for ship noise impacts on humpback whale foraging behaviour. Biol Lett 12, 20160005. https://doi.org/10.1098/rsbl.2016.0005

Brophy, C., 2015. Should ecologists be banned from using p-values? Journal of Ecology Blog. URL https://jecologyblog.com/2015/03/06/should-ecologists-be-banned-from-using-p-values/ (accessed 4.19.24).

Castilho, L.B., Prado, P.I., 2021. Towards a pragmatic use of statistics in ecology. PeerJ 9, e12090. https://doi.org/10.7717/peerj.12090

Gailey, G., Sychenko, O., Zykov, M., Rutenko, A., Blanchard, A., Melton, R.H., 2022. Western gray whale behavioral response to seismic surveys during their foraging season. Environ Monit Assess 194, 740. https://doi.org/10.1007/s10661-022-10023-w

Halsey, L.G., 2019. The reign of the p-value is over: what alternative analyses could we employ to fill the power vacuum? Biology Letters 15, 20190174. https://doi.org/10.1098/rsbl.2019.0174

Johnson, V.E., Pramanik, S., Shudde, R., 2023. Bayes factor functions for reporting outcomes of hypothesis tests. Proceedings of the National Academy of Sciences 120, e2217331120. https://doi.org/10.1073/pnas.2217331120

Lemos, L.S., Haxel, J.H., Olsen, A., Burnett, J.D., Smith, A., Chandler, T.E., Nieukirk, S.L., Larson, S.E., Hunt, K.E., Torres, L.G., 2022. Effects of vessel traffic and ocean noise on gray whale stress hormones. Sci Rep 12, 18580. https://doi.org/10.1038/s41598-022-14510-5

LU, Y., BELITSKAYA-LEVY, I., 2015. The debate about p-values. Shanghai Arch Psychiatry 27, 381–385. https://doi.org/10.11919/j.issn.1002-0829.216027

Melcón, M.L., Cummins, A.J., Kerosky, S.M., Roche, L.K., Wiggins, S.M., Hildebrand, J.A., 2012. Blue Whales Respond to Anthropogenic Noise. PLOS ONE 7, e32681. https://doi.org/10.1371/journal.pone.0032681

Murtaugh, P.A., 2014. In defense of P values. Ecology 95, 611–617. https://doi.org/10.1890/13-0590.1

Vidgen, B., Yasseri, T., 2016. P-Values: Misunderstood and Misused. Front. Phys. 4. https://doi.org/10.3389/fphy.2016.00006

From Bytes to Behaviors: How AI is Used to Study Whales

By Natalie Chazal, PhD student, OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

In today’s media, artificial intelligence, or AI, has captured headlines that can stir up strong emotions and opinions. From promises of seemingly impossible breakthroughs to warnings of job displacement and ethical dilemmas, there is a lot of discourse surrounding AI. 

But what actually is artificial intelligence? The term artificial intelligence (or AI) was defined as “the science and engineering of making intelligent machines” and can generally describe a suite of methods used to simulate human information processing. 

AI actually began in the 1950s with puzzle solving robots and networks that identified shapes. But because the computational power required to run these complex networks was too high and funding cuts, there was an “AI winter” for the following decades. In the 1990’s there was a boom in advancement following renewed interest in AI, advancements in machine learning algorithms, and improved computational power. The 2010’s saw a resurgence of deep learning (a subfield of AI) designed because of the availability of large datasets and optimization algorithm improvements. Currently, AI is being used in extremely diverse ways because of its ability to handle large quantities of unstructured data.

Figure 1. An intuitive visualization of the nested relationship between AI, machine learning, and deep learning as subdomains (Rubbens et al. 2023)

To place AI in a better context, we should clarify some of the buzz words I’ve mentioned: artificial intelligence (AI), machine learning, and deep learning. There are a few schools of thought, but one that is generally accepted is that AI is a broad category of methods and techniques of systems that function to mimic human intelligence. Machine learning falls under this AI category but rather than using explicitly programmed rules to make decisions, we “train” these systems so that they are essentially learning from the data that we provide. Lastly, deep learning falls under machine learning because it uses the principles of “learning” from the data to build neural networks.

While AI is generally rooted in computer science, statistics provides the foundation for AI techniques. In particular, statistical learning is a combined field that adopts machine learning methods for more statistics based settings. Trevor Hastie, a leader in statistical learning, defines the field as “a set of tools for modeling and understanding complex datasets” (Hastie et al. 2009) and is used to explore patterns in data but within a statistical framework. 

Continuously improving methods like statistical learning and AI provide us with very powerful tools to collect data, automate processing, handle large datasets, and understand complex processes. 

How do marine mammal ecologists employ AI?

Even on small scales, marine mammal research often involves vast amounts of data collected from tons of different sources, including drone and satellite imagery, acoustic recordings, boat surveys, buoys, and many more. New deep learning tools, such as neural networks, are able to perform tasks with remarkable precision and speed that we traditionally needed to painstakingly do manually. For example, researchers spend hours poring over thousands of drone images and videos to understand the behavior and health of whales. In the GEMM Lab, postdoc KC Bierlich is leading the development of AI models to automatically measure important whale metrics from the images. These advancements streamline the process of understanding whale ecology and makes it easier to identify stressors that may be affecting these animals.

For photographic analyses, we can leverage Convolutional Neural Networks for tasks like feature extraction, where we can automatically get morphological measurements like body length and body area indices from drone imagery to understand the health of whales. This can provide valuable insight into the stressors placed on these animals. 

We can also identify whale species from boat and aerial imagery (Patton et al. 2023). Projects like Flukebook and Happywhale have even been able to identify individual humpback whales with techniques like this one. 

Figure 2. Flukebook neural networks can use the edges of flukes to identify individuals by mapping marks to a library of known individuals (Flukebook)

AI also excels at prediction especially with non-linear responses. Ecology is filled with thresholds, stepwise changes, and chaos that may not be captured by linear models. But being able to predict these responses is particularly important when we want to look at how whale populations respond to different facets of their environment. Ensemble machine learning algorithms like Random Forests or Gradient Boosting Machines are very common to model species-habitat relationships and can predict how whale distributions will change in response to changes in things like sea surface temperature or ocean currents (Viquerat et al. 2022). 

Even spatial data, which can be tricky to work with analytically, can be used in a machine learning framework. Data from satellite and acoustic tags can be analyzed from hidden Markov models and Gaussian mixture models. The results of these could potentially identify diving behaviors, habitat preferences, identify migration corridors, and aid in marine spatial planning (Quick et al. 2017; Lennox et al. 2019). 

While all of these projects and methods are very exciting, AI is not a panacea. We have to take into account the amount of data that AI models rely on. Some of these methods require very high resolutions of data and without adequate quantity to train the models, results can be biased or produce inaccurate predictions. Data deficiency can be especially problematic for rare, elusive, and quiet animals. Methods that utilize complex architectures and non-linear transformations can often be viewed as “black box” and difficult to interpret at first. However, there are some methods that can be used to retrace the steps of the model and create a pathway of understanding for the results that can help interpretability. AI also requires supervision. While AI methods can operate autonomously, oversight and evaluation are always necessary to validate their reliability in their application.  Lastly, there are also concerns about the use of AI (particularly Large Language Models) in scientific writing, but that’s a whole separate beast. 

With careful consideration, AI can be a powerful method for addressing the unique and challenging problems in marine mammal research. 

Using AI to find dinner

Last fall, I wrote a blog post to introduce my project that involves looking at echograms from the past 8 years of GRANITE effort to characterize prey availability within our study region of the Oregon coast. To automate the process of finding zooplankton swarms in 8 years of echosounder data, I’m planning to utilize deep learning methods to look for structures in our echogram that look like mysid swarms. Instead of reviewing over 500 hours of echosounder data to manually identify mysid swarms (which may produce biased or inaccurate results from human error), I can apply AI methods to process the echogram data with speed and consistent rules. I’ll specifically be using image segmentation, which can fall under any of the AI, machine learning, or deep learning umbrellas depending on the specific algorithms used. 

Another way AI can come into my project is after I gather the mysid swarm data from the image segmentation. While the exact structure of this resulting relative zooplankton abundance data will influence how I can use it, I could combine these prey data at a given place and time with a suite of environmental parameters to make predictions about the health and behavior of PCFG gray whales. This type of analysis could involve models that fall within AI and machine learning similar to the Boosted Regression Trees used by GEMM Labs postdoc, Dawn Barlow. Barlow et al. (2020) used Boosted Regression Trees to test the predictive relationships between oceanographic variables, relative krill abundance, and blue whale presence. Based on that work, Barlow et al. (2022) was able to develop a forecasting model based on these relationships to predict where blue whales will be in New Zealand’s South Taranaki Bight (read more about this conservation tool here!).

Hopefully by now you’ve gained a better sense of what AI actually is and its application in marine mammal ecology. AI is a powerful tool and has its value, but is not always a substitute for more established methods. By carefully integrating AI methodologies with other techniques, we can leverage the strengths of both and enhance existing approaches. The GEMM Lab aims to use AI methods to observe and understand the intricacies of whale ecology more accurately and efficiently to ultimately support effective conservation strategies.

References

  1. Rubbens, P., Brodie, S., Cordier, T., Destro Barcellos, D., Devos, P., Fernandes-Salvador, J.A., Fincham, J.I., Gomes, A., Handegard, N.O., Howell, K., Jamet, C., Kartveit, K.H., Moustahfid, H., Parcerisas, C., Politikos, D., Sauzède, R., Sokolova, M., Uusitalo, L., Van den Bulcke, L., van Helmond, A.T.M., Watson, J.T., Welch, H., Beltran-Perez, O., Chaffron, S., Greenberg, D.S., Kühn, B., Kiko, R., Lo, M., Lopes, R.M., Möller, K.O., Michaels, W., Pala, A., Romagnan, J.-B., Schuchert, P., Seydi, V., Villasante, S., Malde, K., Irisson, J.-O., 2023. Machine learning in marine ecology: an overview of techniques and applications. ICES Journal of Marine Science 80, 1829–1853. https://doi.org/10.1093/icesjms/fsad100
  2. Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd ed.). Stanford, CA: Stanford University.
  3. Slonimer, A.L., Dosso, S.E., Albu, A.B., Cote, M., Marques, T.P., Rezvanifar, A., Ersahin, K., Mudge, T., Gauthier, S., 2023. Classification of Herring, Salmon, and Bubbles in Multifrequency Echograms Using U-Net Neural Networks. IEEE Journal of Oceanic Engineering 48, 1236–1254. https://doi.org/10.1109/JOE.2023.3272393
  4. Viquerat, S., Waluda, C.M., Kennedy, A.S., Jackson, J.A., Hevia, M., Carroll, E.L., Buss, D.L., Burkhardt, E., Thain, S., Smith, P., Secchi, E.R., Santora, J.A., Reiss, C., Lindstrøm, U., Krafft, B.A., Gittins, G., Dalla Rosa, L., Biuw, M., Herr, H., 2022. Identifying seasonal distribution patterns of fin whales across the Scotia Sea and the Antarctic Peninsula region using a novel approach combining habitat suitability models and ensemble learning methods. Frontiers in Marine Science 9.
  5. Patton, P.T., Cheeseman, T., Abe, K., Yamaguchi, T., Reade, W., Southerland, K., Howard, A., Oleson, E.M., Allen, J.B., Ashe, E., Athayde, A., Baird, R.W., Basran, C., Cabrera, E., Calambokidis, J., Cardoso, J., Carroll, E.L., Cesario, A., Cheney, B.J., Corsi, E., Currie, J., Durban, J.W., Falcone, E.A., Fearnbach, H., Flynn, K., Franklin, T., Franklin, W., Galletti Vernazzani, B., Genov, T., Hill, M., Johnston, D.R., Keene, E.L., Mahaffy, S.D., McGuire, T.L., McPherson, L., Meyer, C., Michaud, R., Miliou, A., Orbach, D.N., Pearson, H.C., Rasmussen, M.H., Rayment, W.J., Rinaldi, C., Rinaldi, R., Siciliano, S., Stack, S., Tintore, B., Torres, L.G., Towers, J.R., Trotter, C., Tyson Moore, R., Weir, C.R., Wellard, R., Wells, R., Yano, K.M., Zaeschmar, J.R., Bejder, L., 2023. A deep learning approach to photo–identification demonstrates high performance on two dozen cetacean species. Methods in Ecology and Evolution 14, 2611–2625. https://doi.org/10.1111/2041-210X.14167
  6. https://happywhale.com/whaleid
  7. https://www.flukebook.org/
  8. Quick, N.J., Isojunno, S., Sadykova, D., Bowers, M., Nowacek, D.P., Read, A.J., 2017. Hidden Markov models reveal complexity in the diving behaviour of short-finned pilot whales. Sci Rep 7, 45765. https://doi.org/10.1038/srep45765
  9. Lennox, R.J., Engler-Palma, C., Kowarski, K., Filous, A., Whitlock, R., Cooke, S.J., Auger-Méthé, M., 2019. Optimizing marine spatial plans with animal tracking data. Can. J. Fish. Aquat. Sci. 76, 497–509. https://doi.org/10.1139/cjfas-2017-0495
  10. Barlow, D.R., Bernard, K.S., Escobar-Flores, P., Palacios, D.M., Torres, L.G., 2020. Links in the trophic chain: modeling functional relationships between in situ oceanography, krill, and blue whale distribution under different oceanographic regimes. Marine Ecology Progress Series 642, 207–225. https://doi.org/10.3354/meps13339

Sonar savvy: using echo sounders to characterize zooplankton swarms

By Natalie Chazal, PhD student, OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

I’m Natalie Chazal, the GEMM Lab’s newest PhD student! This past spring I received my MS in Biological and Agricultural Engineering with Dr. Natalie Nelson’s Biosystems Analytics Lab at North Carolina State University. My thesis focused on using shellfish sanitation datasets to look at water quality trends in North Carolina and to forecast water quality for shellfish farmers in Florida. Now, I’m excited to be studying gray whales in the GEMM Lab!

Since the beginning of the Fall term, I’ve jumped into a project that will use our past 8 years of sonar data collected using a Garmin echo sounder during the GRANITE project work with gray whales off the Newport, OR coast. Echo sounder data is commonly used recreationally to detect bottom depth and for finding fish and my goal is to use these data to assess relative prey abundance at gray whale sightings over time and space. 

There are also scientific grade echo sounders that are built to be incredibly precise and very exact in the projection and reception of the sonar pulses. Both types of echosounders can be used to determine the depth of the ocean floor, structures within the water column, and organisms that are swimming within the sonar’s “cone” of acoustic sensing. The precision and stability of the scientific grade equipment allows us to answer questions related to the specific species of organisms, the substrate type at the sea floor, and even animal behavior. However, scientific grade echo sounders can be expensive, overly large for our small research vessel, and require expertise to operate. When it comes to generalists, like gray whales, we can answer questions about relative prey abundances without the use of such exact equipment (Benoit-Bird 2016; Brough 2019). 

While there are many variations of echo sounders that are specific to their purpose, commercially available, single beam echo sounders generally function in the same way (Fig. 1). First, a “ping” or short burst of sound at a specific frequency is produced from a transducer. The ping then travels downward and once it hits an object, some of the sound energy bounces off of the object and some moves into the object. The sound that bounces off of the object is either reflected or scattered. Sound energy that is either reflected or scattered back in the direction of the source is then received by the transducer. We can figure out the depth of the signal using the amount of travel time the ping took (SeaBeam Instruments 2000).

Figure 1. Diagram of how sound is scattered, reflected, and transmitted in marine environments (SeaBeam Instruments, 2000).

The data produced by this process is then displayed in real-time, on the screen on board the boat. Figure 2 is an example of the display that we see while on board RUBY (the GEMM Lab’s rigid-hull inflatable research boat): 

Figure 2. Photo of the echo sounder display on board RUBY. On the left is a map that is used for navigation. On the right is the real time feed where we can see the ocean bottom shown as the bright yellow area with the distinct boundary towards the lower portion of the screen. The more orange layer above that, with the  more “cloudy” structure  is a mysid swarm.

Once off the boat, we can download this echo sounder data and process it in the lab to recreate echograms similar to those seen on the boat. The echograms are shown with the time on the x-axis, depth on the y-axis, and are colored by the intensity of sound that was returned (Fig. 3). Echograms give us a sort of picture of what we see in the water column. When we look at these images as humans, we can infer what these objects are, given that we know what habitat we were in. Below (Fig. 3) are some example classifications of different fish and zooplankton swarms and what they look like in an echogram (Kaltenberg 2010).

Figure 3. Panel of echogram examples, from Kaltenberg 2010, for different fish and zooplankton aggregations that have been classified both visually (like we do in real time on the boat) as well as statistically (which we hope to do with the mysid aggregations). 

For our specific application, we are going to focus on characterizing mysid swarms, which are considered to be the main prey target of PCFG whales in our study area. With the echograms generated by the GRANITE fieldwork, we can gather relative mysid swarm densities, giving us an idea of how much prey is available to foraging gray whales. Because we have 8 years of GRANITE echosounder data, with 2,662 km of tracklines at gray whale sightings, we are going to need an automated process. This demand is where image segmentation can come in! If we treat our echograms like photographs, we can train models to identify mysid swarms within echograms, reducing our echogram processing load. Automating and standardizing the process can also help to reduce error. 

We are planning to utilize U-Nets, which are a method of image segmentation where the image goes through a series of compressions (encoders) and expansions (decoders), which is common when using convolutional neural nets (CNNs) for image segmentation. The encoder is generally a pre-trained classification network (CNNs work very well for this) that is used to classify pixels into a lower resolution category. The decoder then takes the low resolution categorized pixels and reprojects them back into an image to get a segmented mask. What makes U-Nets unique is that they re-introduce the higher resolution encoder information back into the decoder process through skip connections. This process allows for generalizations to be made for the image segmentation without sacrificing fine-scale details (Brautaset 2020; Ordoñez 2022; Slonimer 2023; Vohra 2023).

Figure 4. Diagram of the encoder, decoder architecture for U-Nets used in biomedical image segmentation. Note the skip connections illustrated by the gray lines connecting the higher resolution image information on the left, with the decoder process on the right (Ronneberger 2015)

What we hope to get from this analysis is an output image that provides us only the parts of the echogram that contain mysid swarms. Once the mysid swarms are found within the echograms, we can use both the intensity and the size of the swarm in the echogram as a proxy for the relative abundance of gray whale prey. We plan to quantify these estimates across multiple spatial and temporal scales, to link prey availability to changing environmental conditions and gray whale health and distribution metrics. This application is what will make our study particularly unique! By leveraging the GRANITE project’s extensive datasets, this study will be one of the first studies that quantifies prey variability in the Oregon coastal system and uses those results to directly assess prey availability on the body condition of gray whales. 

However, I have a little while to go before the data will be ready for any analysis. So far, I’ve been reading as much as I can about how sonar works in the marine environment, how sonar data structures work, and how others are using recreational sonar for robust analyses. There have been a few bumps in the road while starting this project (especially with disentangling the data structures produced from our particular GARMIN echosounder), but my new teammates in the GEMM Lab have been incredibly generous with their time and knowledge to help me set up a strong foundation for this project, and beyond. 

References

  1. Kaltenberg A. (2010) Bio-physical interactions of small pelagic fish schools and zooplankton prey in the California Current System over multiple scales. Oregon State University, Dissertation. https://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/z890rz74t
  2. SeaBeam Instruments. (2000) Multibeam Sonar Theory of Operation. L-3 Communications, East Walpole MA. https://www3.mbari.org/data/mbsystem/sonarfunction/SeaBeamMultibeamTheoryOperation.pdf
  3. Benoit-Bird K., Lawson G. (2016) Ecological insights from pelagic habitats acquired using active acoustic techniques. Annual Review of Marine Science. https://doi.org/10.1146/annurev-marine-122414-034001
  4. Brough T., Rayment W., Dawson S. (2019) Using a recreational grade echosounder to quantify the potential prey field of coastal predators. PLoS One. https://doi.org/10.1371/journal.pone.0217013
  5. Brautaset O., Waldeland A., Johnsen E., Malde K., Eikvil L., Salberg A, Handegard N. (2020) Acoustic classification in multifrequency echosounder data using deep convolutional neural networks. ICES Journal of Marine Science 77, 1391–1400. https://doi.org/10.1093/icesjms/fsz235
  6. Ordoñez A., Utseth I., Brautaset O., Korneliussen R., Handegard N. (2022) Evaluation of echosounder data preparation strategies for modern machine learning models. Fisheries Research 254, 106411. https://doi.org/10.1016/j.fishres.2022.106411
  7. Slonimer A., Dosso S., Albu A., Cote M., Marques T., Rezvanifar A., Ersahin K., Mudge T., Gauthier S., (2023) Classification of Herring, Salmon, and Bubbles in Multifrequency Echograms Using U-Net Neural Networks. IEEE Journal of Oceanic Engineering 48, 1236–1254. https://doi.org/10.1109/JOE.2023.3272393
  8. Ronneberger O., Fischer P., Brox T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. https://doi.org/10.48550/arXiv.1505.04597