First Flight

By Lindsay Wickman, Postdoc, OSU Department of Fisheries, Wildlife, and Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

I’ve had the privilege of attending several marine mammal surveys aboard ships at sea, but I had never surveyed for marine mammals from the air. So, when given the opportunity to participate in ongoing aerial surveys off the Oregon Coast with the US Coastguard’s helicopter fleet, I enthusiastically said yes. As Craig Hayslip, a Faculty Research Assistant with the Marine Mammal Institute, prepared me for my first helicopter survey, I was all excitement and no nerves. That is, until he explained the seating arrangement.

“There are two types of helicopters you’ll be flying on, and because of the seating arrangement in the Jayhawk, we fly with the door open when surveying for whales – it’s the only way to get a sufficient view,” Craig casually explained. I stared at the iPad I would use for recording data and imagined it flying through that open door and toward the sea, while I looked on flustered and helpless. Sensing my worry, Craig quickly showed me a set of straps that attached to the iPad, so it could be secured to one of my legs.

In addition to ensuring the iPad stayed in the aircraft, the straps also meant my hands would still be free to handle the camera (to aid in species identification), and a small tool called a geometer (developed by Pi Techology). By lining up the whale sighting in the sight of the geometer, the observer can record the angle between the aircraft and the sighting. Since we also know the height of the helicopter (we fly at a constant altitude of 500 feet), this angle can be used to calculate horizontal distance from the aircraft, allowing an accurate location to be estimated for each sighting.

My first flight was from Warrenton, Oregon, a four-hour drive north from the Hatfield Marine Science Center in Newport. Once at the airport, our first stop was to head to the flight operations office (a.k.a. “Ops”), who set us up with proper clothing and headgear for the flight. As we checked in, rock music played on a speaker while uniformed Coast Guardsmen serviced a helicopter in the hangar. I started to feel like a cool insider, until I clumsily donned the canvas flight suit and tried on several helmets. Suddenly several pounds heavier, all my movements became very awkward.

Lindsay outside the hangar wearing flight gear, in front of the survey’s helicopter. Photo by Craig Hayslip.

After my safety briefing, the entire crew gathered for a pre-flight meeting. We discussed weather conditions, did a wellness check, and discussed the flight’s mission. The conversation also included a brief overview of our scientific aims – why exactly were we looking for whales?

Craig briefly described the research project we were contributing to, titled Overlap Predictions About Large whales (OPAL). The main goal of this project is to better understand the overlap between whales and fisheries, with the aim of reducing entanglement risk. Fishing methods that use fixed, vertical lines in the water column, like the Dungeness crab fishery, can entangle whales as they migrate and feed along Oregon’s coastline. Since reports of whale entanglements have increased on the West Coast in the last 10 years, managing this threat is essential to ensure both the health of whale populations and the stability of Oregon’s crab fishery. Preventing these entanglements requires an understanding of where whales are distributed along the coast, as well as the times of year overlap with fisheries is most likely to occur. The OPAL project isn’t just mapping whale sightings, though. By using models to correlate whale sightings with oceanographic conditions, OPAL is also aiming to predict where whales are likely to occur.

After explaining the mission, the crew had to reach a consensus on both the level of “risk” in the mission and its level of “gain.” For a whale survey flight, risk was deemed low, with medium gain. While I initially felt mild offence that our scientific work was deemed to have just “medium” gain, I quickly reminded myself that when the crew is not flying scientists around, they are literally saving human lives. It was also a reminder that our whale surveys could easily be interrupted if necessary – Craig had mentioned several instances where flights were diverted to assist in rescue or medical emergencies.

With the briefing over, each of us had to consent to the flight plan by saying, “I accept this mission.” I’d heard this phrase from secret agents and soldiers in movies, but never from a marine scientist. I felt out of place saying them at first, but the words undeniably helped me establish a self-assured confidence I would give the survey my 100%.

Finally, it was time to head out of the hangar and to the aircraft. With both a pair of earplugs and my flight helmet on, the whirring of the blades was just a soft hum. I couldn’t hear speech, so we all relied on hand signals to communicate until our headsets were connected to the aircraft. The crew helped make sure I correctly put on my seatbelt harness, which had not just one, but five buckles. While I still felt some mild concern for the iPad strapped to my leg, at least I knew I wouldn’t fall out.

Lindsay holds up the geometer during the flight. Photo by Craig Hayslip.

Craig helped ensure I had all the equipment set up properly: the iPad’s survey program, the GPS tracking, and the computer recording the geometer’s measurements. Soon after, the helicopter slowly rose, hovering above the runway, before turning and heading towards the coast at speed. My stomach dropped slightly, my ears popped, and cold air rushed through the open door. I looked out at the Columbia River as it stretched toward the coastline and out to sea, and I couldn’t stop smiling.

A rainbow mid-air. Photo by Craig Hayslip.

As we approached the ocean, my attention shifted back to the mission, and I started scanning the surface for whale blows. With the large helmet on, I noticed the camera and geometer were much more difficult to use, so I also made “practice sightings” of passing boats and buoys. It didn’t take long before my first real whale sighting though – two gray whales (Eschrichtius robustus). Over the next two hours, I saw four more gray whales, and six more whales I was unable to identify due to distance. With each sighting, I had to act fast to make each geometer recording. The helicopter travels at a speed of 90 knots and whales can disappear soon after surfacing.

Two hours of flying with the door open meant my nose was running and my typing skills were worsening due to cold fingers. As exciting as it was to spot whales from the air, I was a little relieved when we arrived back at the airport and I could warm back up. Luckily, my nightmare of losing an iPad from the helicopter did not come true, and I was returning home with another survey to add to over 200 (and counting!) helicopter surveys completed for the OPAL project. Four different flights covering different parts of the Oregon coast are completed each month, so I know I have more flights to look forward to. After a successful first mission, I feel ready to take on my next flight.

The four flight routes completed monthly for the OPAL project. Helicopter flights are enabled through a partnership with the US Coastguard.

If you’d like to learn more about the OPAL research project, check out these past blog posts:

A Matter of Time: Adaptively Managing the Timescales of Ocean Change and Human Response

The pathway to advancing knowledge of rorqual whale distribution off Oregon

From land, sea,… and space: searching for whales in the vast ocean

The ups and downs of the ocean

Recent publications presenting findings from the first two years of OPAL include:

Derville, S., Barlow, D. R., Hayslip, C., & Torres, L. G. (2022). Seasonal, Annual, and Decadal Distribution of Three Rorqual Whale Species Relative to Dynamic Ocean Conditions Off Oregon, USA. Frontiers in Marine Science, 9. https://doi.org/10.3389/fmars.2022.868566

Derville, S., Buell, T. v., Corbett, K. C., Hayslip, C., & Torres, L. G. (2023). Exposure of whales to entanglement risk in Dungeness crab fishing gear in Oregon, USA, reveals distinctive spatio-temporal and climatic patterns. Biological Conservation, 281. https://doi.org/10.1016/j.biocon.2023.10998

Did you enjoy this blog? Want to learn more about marine life, research, and conservation? Subscribe to our blog and get a weekly message when we post a new blog. Just add your name and email into the subscribe box below.

Loading

Significant others? Thinking beyond p-values in science

By Natalie Chazal, PhD student, OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

Scientific inquiry relies on quantifying how certain we are of the differences we see in observations. This means that we must look at phenomena based on probabilities that we calculate from observed data, or data that we collect from sampling efforts. Historically, p-values have served as a relatively ubiquitous tool for assessing the strength of evidence in support of a hypothesis. However, as our understanding of statistical methods evolves, so does the scrutiny surrounding the appropriateness and interpretation of p-values. In the realm of research, the debate surrounding the use of p-values for determining statistical significance has sparked some controversy and reflection within the academic community. 

What is a p-value?

To understand the debate itself, we need to understand what a p-value is. The p-value represents the probability of obtaining a result as extreme as, or more extreme than, the observed data, under the assumption that there is no true difference or relationship between groups or variables. Traditionally, a p-value below a predetermined threshold (often 0.05) is considered statistically significant, suggesting that the observed data are unlikely (i.e., a 5% probability) to have occurred by chance alone. Many statistical tests provide p-values, which gives us a unified framework for interpretation across a range of analyses.

To illustrate this, imagine a study aimed at investigating the effects of underwater noise pollution on the foraging behavior of gray whales. Researchers collect data on the diving behavior of gray whales in both noisy and quiet regions of the ocean.

Drawings of gray whales with tags (depicted by orange shapes) in quiet areas (left) and noisy areas (right). 

In this example, the researchers hypothesize that gray whales stop foraging and ultimately change their diving behavior in response to increased marine noise pollution. The data collected from this hypothetical scenario could come from tags equipped with sensors that record diving depth, duration, and location, allowing us to calculate the exact length of time spent foraging. Data would be collected from both noisy areas (maybe near shipping lanes or industrial sites) and quiet areas (more remote regions with minimal human activity). 

To assess the significance of the differences between the two noise regimes, researchers may use statistical tests like t-tests to compare two groups. In our example, researchers use a t-test to compare the average foraging time between whales in noisy and quiet regimes. The next step would be to define hypotheses about the differences we expect to see. The null hypothesis (HN) would be that there is no difference in the average foraging time (X) between noisy and quiet areas: 

Scenario where the noisy area does not elicit a behavioral response that can be detected by the data collected by the tags (orange shapes on whales back). The lower graph shows the distribution of the data (foraging time) for the noisy and the quiet areas. The means of this data (X) are not different. 

And the alternative hypothesis (HA) would be that there is a difference between the noisy and quiet areas: 

Scenario where the noisy area elicits a behavioral response (swimming more towards the surface instead of foraging) that can be detected by the data collected by the tags (orange shapes on whales back). The lower graph shows the distribution of the data (foraging time) for the noisy and the quiet areas. The means of this data (X) are different with the noisy mean foraging time (pink) being lower than the quiet mean foraging time (blue).

For now, we will skip over the nitty gritty of a t-test and just say that the researchers get a “t-score” that says whether or not there is a difference in the means (X) of the quiet and noisy areas. A larger t-score means that there is a difference in the means whereas a smaller t-score would indicate that the means are more similar. This t-score comes along with a p-value. Let’s say we get a t-score (green dot) that is associated with a p-value of 0.03 shown as the yellow area under the curve: 

The t-score is a test statistic that tells us how different the means of our observed data groups are from each other (green dot). The area under the t-distribution that is above the t-score is the p-value (yellow shaded area).

A p-value of 0.03 means that there is a 3% probability of obtaining these observed differences in foraging time between noisy and quiet areas purely by chance, which assumes that the null hypothesis is true (that there is no difference). We usually compare this p-value to a threshold value to say whether this finding is significant. We set this threshold before looking at the results of the test. If the threshold is above our value, like 0.05, then we can “reject the null hypothesis” and conclude that there is a significant difference in foraging time between noisy and quiet areas (green check mark scenario). On the flip-side, if the threshold that we set before our results is too low (0.01), then we will “fail to reject the null hypothesis” and conclude that there was no significant difference in foraging time between noisy and quiet areas (red check mark scenario). The reason that we don’t ever “accept the null” is because we are testing an alternative hypothesis with observations and if those observations are consistent with the null rather than the alternative, this is not evidence for the null because it could be consistent with a different alternative hypothesis that we are not yet testing for.

When our pre-set threshold to determine significance is above or greater than our p-value that was calculated we have enough evidence to ‘reject the null hypothesis’ (left figure) whereas if our p-value is lower or smaller than our calculated p-value, then we ‘fail to reject the null hypothesis’ (right figure).

In this example, the use of p-values helps the researchers quantify the strength of evidence for their hypothesis and determine whether the observed differences in gray whale behavior are likely to be meaningful or merely due to chance. 

The Debate

Despite its widespread use, the reliance on p-values has been met with criticism. Firstly, because p-values are so ubiquitous, it can be easy to calculate them with or without enough critical thinking or interpretation. This critical thinking should include an understanding of what is biologically relevant and avoid the trap of using binary language like significant or non-significant results instead of looking directly at the uncertainty of your results. One of the other most common misconceptions about p-values is that they can measure the direct probability of the null hypothesis being true. As amazing as that would be, in reality we can only use p-values to understand the probability of our observed data. Additionally, it’s common to conflate the significance or magnitude of the p-value with effect size (which is the strength of the relationship between the variables). You can have a small p-value for an effect that isn’t very large or meaningful, especially if you have a large sample size. Sample size is an important metric to report. Larger number of samples generally means more precise estimates, higher statistical power, increased generalizability, and higher possibility for replication.

Furthermore, in studies that require multiple comparisons (i.e. multiple statistical analyses are done in a single study), there is an increased likelihood of observing false positives because each test introduces a chance of obtaining a significant result by random variability alone. In p-value language, a “false positive” is when you say something is significant (below your p-value threshold) when it actually is not, and a “false negative” is when you say something is not significant (above the p-value threshold) when it actually is. So, in terms of multiple comparisons, if there are no adjustments made for the increased risk of false positives, this can potentially lead to inaccurate conclusions of significance.

In our example using foraging time in gray whales, we didn’t consider the context of our findings. To make this a more reliable study, we have to consider factors like the number of whales tagged (sample size!), the magnitude of noise near the tagged whales, other variables in the environment (e.g. prey availability) that could affect our results, and the ecological significance in the difference in foraging time that was found. To make robust conclusions, we need to carefully build hypotheses and study designs that will answer the questions we seek. We must then carefully choose the statistical tests that we use and explore how our data align with the assumptions that these tests make. It’s essential to contextualize our results within the bounds of our study design and broader ecological system. Finally, performing sensitivity analyses (e.g. running the same tests multiple times on slightly different datasets) ensures that our results are stable over a variety of different model parameters and assumptions. 

In the real world, there have been many studies done on the effects of noise pollution on baleen whale behavior that incorporate multiple sources of variance and bias to get robust results that show behavioral responses and physiological consequences to anthropogenic sound stressors (Melcón et al. 2012, Blair et al. 2016, Gailey et al. 2022, Lemos et al. 2022).

Moving Beyond P-values

There has been growing interest in reassessing the role of p-values in scientific inference and publishing. Scientists appreciate p-values because they provide one clear numeric threshold to determine significance of their results. However, the reality is more complicated than this binary approach. We have to explore the uncertainty around these estimates and test statistics (e.g. t-score) and what they represent ecologically. One avenue to explore might be focusing more on effect sizes and confidence intervals as more informative measures of the magnitude and precision of observed effects. There has also been a shift towards using Bayesian methods, which allow for the incorporation of prior knowledge and a more nuanced quantification of uncertainty.

Bayesian methods in particular are a leading alternative to p-values because instead of looking at how likely our observations are given a null hypothesis, we get a direct probability of the hypothesis given our data. For example, we can use Bayes factor for our noisy vs quiet gray whale behavioral t-test (Johnson et al. 2023). Bayes factor measures the likelihood of the data being observed for each hypothesis separately (instead of assuming the null hypothesis is true) so if we calculate a Bayes factor of 3 for the alternative hypothesis (HA), we could directly say that it is 3 times more likely for there to be decreased foraging time in a noisy area than for there to be no difference in the noisy vs quiet group. But that is just one example of Bayesian methods at work. The GEMM lab uses Bayesian methods in many projects from Lisa’s spatial capture-recapture models (link to blog) and Dawn’s blue whale abundance estimates (Barlow et al. 2018) to quantifying uncertainty associated with drone photogrammetry data collection methods in KC’s body size models (link to blog). 

Ultimately, the debate surrounding p-values highlights the necessity of nuanced and transparent approaches to statistical inference in scientific research. Rather than relying solely on arbitrary thresholds, researchers can consider the context, relevance, and robustness of their findings. From justifying our significance thresholds to directly describing parameters based on probability, we have increasingly powerful tools to improve the methodological rigor of our studies. 

References

Agathokleous, E., 2022. Environmental pollution impacts: Are p values over-valued? Science of The Total Environment 850, 157807. https://doi.org/10.1016/j.scitotenv.2022.157807

Barlow, D.R., Torres, L.G., Hodge, K.B., Steel, D., Baker, C.S., Chandler, T.E., Bott, N., Constantine, R., Double, M.C., Gill, P., Glasgow, D., Hamner, R.M., Lilley, C., Ogle, M., Olson, P.A., Peters, C., Stockin, K.A., Tessaglia-Hymes, C.T., Klinck, H., 2018. Documentation of a New Zealand blue whale population based on multiple lines of evidence. Endangered Species Research 36, 27–40. https://doi.org/10.3354/esr00891

Blair, H.B., Merchant, N.D., Friedlaender, A.S., Wiley, D.N., Parks, S.E., 2016. Evidence for ship noise impacts on humpback whale foraging behaviour. Biol Lett 12, 20160005. https://doi.org/10.1098/rsbl.2016.0005

Brophy, C., 2015. Should ecologists be banned from using p-values? Journal of Ecology Blog. URL https://jecologyblog.com/2015/03/06/should-ecologists-be-banned-from-using-p-values/ (accessed 4.19.24).

Castilho, L.B., Prado, P.I., 2021. Towards a pragmatic use of statistics in ecology. PeerJ 9, e12090. https://doi.org/10.7717/peerj.12090

Gailey, G., Sychenko, O., Zykov, M., Rutenko, A., Blanchard, A., Melton, R.H., 2022. Western gray whale behavioral response to seismic surveys during their foraging season. Environ Monit Assess 194, 740. https://doi.org/10.1007/s10661-022-10023-w

Halsey, L.G., 2019. The reign of the p-value is over: what alternative analyses could we employ to fill the power vacuum? Biology Letters 15, 20190174. https://doi.org/10.1098/rsbl.2019.0174

Johnson, V.E., Pramanik, S., Shudde, R., 2023. Bayes factor functions for reporting outcomes of hypothesis tests. Proceedings of the National Academy of Sciences 120, e2217331120. https://doi.org/10.1073/pnas.2217331120

Lemos, L.S., Haxel, J.H., Olsen, A., Burnett, J.D., Smith, A., Chandler, T.E., Nieukirk, S.L., Larson, S.E., Hunt, K.E., Torres, L.G., 2022. Effects of vessel traffic and ocean noise on gray whale stress hormones. Sci Rep 12, 18580. https://doi.org/10.1038/s41598-022-14510-5

LU, Y., BELITSKAYA-LEVY, I., 2015. The debate about p-values. Shanghai Arch Psychiatry 27, 381–385. https://doi.org/10.11919/j.issn.1002-0829.216027

Melcón, M.L., Cummins, A.J., Kerosky, S.M., Roche, L.K., Wiggins, S.M., Hildebrand, J.A., 2012. Blue Whales Respond to Anthropogenic Noise. PLOS ONE 7, e32681. https://doi.org/10.1371/journal.pone.0032681

Murtaugh, P.A., 2014. In defense of P values. Ecology 95, 611–617. https://doi.org/10.1890/13-0590.1

Vidgen, B., Yasseri, T., 2016. P-Values: Misunderstood and Misused. Front. Phys. 4. https://doi.org/10.3389/fphy.2016.00006

Baleen analyses reveals patterns in foraging ecology and stress physiology in gray whales prior to death.

Dr. Alejandro A. Fernández Ajó, Postdoctoral Scholar, Marine Mammal Institute – OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna (GEMM) Lab.

The Eastern North Pacific (ENP) gray whale population has experienced at least two recorded Unusual Mortality Events (UMEs), from 1999–2000 and from 2019 to 2024, during which many gray whales stranded along the Pacific coast from northern Mexico to the Alaskan Arctic, USA (Martínez-Aguilar et al., 2019; Urbán, 2020). Several factors have been considered as possible causes for the high number of gray whale’s strandings, including variation in Arctic prey availability and the duration of their feeding season caused by the timing of sea ice formation and breakup (Stewart et al., 2023), starvation, anthropogenically derived toxicants, biotoxins, infectious diseases, parasites, fisheries interactions, and ship strikes (F. Gulland et al., 2005). In the most recent UME, many of the stranded whales showed signs of emaciation, indicating malnutrition as a causal factor of death (Christiansen et al., 2021; Torres et al., 2022). While the poor condition of many of the stranded whales supports the idea of starvation as a cause for these mortalities, the underlying causes of malnutrition are unknown, and it is also unclear whether the whales’ decline in body condition was rapid or gradual.

Figure 1. Gray whale with baleen exposed. Photo: GEMM Lab  NOAA/NMFS permit #16111.

Large whales face a multitude of stressors in their environment, ranging from ocean noise to contaminants, climate change, and prey shifts. Understanding how individual whales respond to these disturbances is crucial for assessing potential impacts on the population as a whole. However, monitoring the health parameters and vital rates of whales presents significant challenges due to their large size, mobility, and the vast ranges of their marine habitat. Studying stranded whales can provide valuable insights into health risks, disease susceptibility, and the impacts of pollutants and other stressors on whale populations, thus informing conservation strategies (see post). Nonetheless, the quality of information obtained from necropsies heavily relies on the timeliness of stranding reports, as decomposition begins immediately after death, limiting detailed investigations into the cause of death. Therefore, establishing a robust network capable of promptly reporting and addressing stranding events is essential (Gulland & Stockin, 2020). An effective network involves having well-trained staff, proper infrastructure, sufficient funding, and the expertise and tools necessary to gather and analyze data and samples to infer their health and causes of mortality.

During my doctoral dissertation, I worked to develop and ground truth the endocrine analyses of whale baleen as a novel sample type that can be used for retrospective assessments of the whale’s physiology (see my previous post & post). Baleen, the filter-feeding apparatus of mysticetes whales (Figure 1), consists of long fringed plates of keratinized tissue that grow continuously and slowly downward from the whale’s upper jaw. These plates are routinely collected at necropsies; and unlike other tissue types, they are durable and have minimum storage requirements; they can be preserved dry at room temperature, allowing for the analysis of both historical and current whale populations. Moreover, while most sample types used for studying whale health and physiology provide a single time-point measure of current circulating hormone levels (e.g., skin or respiratory vapor) or hold integrated information from the previous few hours or days (e.g., urine and feces), baleen tissue provides a unique opportunity for retrospective and longitudinal analyses of multiple biological parameters of the individual during the time that the tissue was grown, i.e., months to years prior to death, helping to describe the whale’s physiology, migration patterns, and exposure to pollutants (see my previous post).

In our recent study, “A longitudinal study of endocrinology and foraging ecology of subadult gray whales prior to death based on baleen analysis”, published in the journal General and Comparative Endocrinology, we examine isotope and hormone levels in the baleen of five young gray whales stranded in central Oregon during the most recent UME. Our primary objectives were to retrospectively examine the hormone and isotopic profiles of the individual whales prior to mortality, assess potential factors contributing to death, and identify the timing for the onset of chronic illness leading to mortality. Our analysis included tracing longitudinal changes in (1) stable isotope values in baleen (δ13C and δ15N), which allowed us to infer the baleen growth rate and assess the seasonal changes in diet and foraging location in large whales (Figure 2), along with the quantification of (2) two adrenal glucocorticoid steroids that are biomarkers for the whale’s stress response, (3) one thyroid hormone (triiodothyronine, T3) as an indicator of nutritional state, and (4) two sex hormones, progesterone and testosterone, to infer about reproductive status and sexual maturity. By integrating isotopic and hormonal methodologies, our study demonstrates how baleen analysis offers a comprehensive narrative of the endocrine and trophic ecology of individual whales over time.

Figure 2. Gray whales, like other large marine mammals that rely on built-up energy reserves, exhibit distinct seasonal shifts in their feeding habits. During summer, these whales feed at the ocean’s bottom, consuming organisms lower in the food chain, which is reflected in lower nitrogen values in their baleen (summer foraging). In winter, however, they must rely on their own fat reserves, causing an increase in nitrogen values (wintering). In this plot we can observe the oscillations in δ15N over time; this information allows us to estimate the baleen growth rate. Our results suggest that gray whale baleen holds a record of around 1.3 years of stable isotopes and hormone data prior to the time of death (Fernandez Ajo et al. 2024). The red cross in the X-axis, indicate the time of death. Gray whale illustration https://www.fisheries.noaa.gov/species/gray-whale

Our endocrine assessments revealed detailed profiles of stress-related hormones (glucocorticoids, cortisol) and thyroid hormones along the lengths of the baleen. We found increased levels of cortisol in whales that died from unknown causes, starting about eight months prior to their deaths. This suggests these whales were under prolonged stress before dying. In contrast, in the case of a whale killed acutely by a killer whale, cortisol levels were low and constant prior to death, indicating this individual was likely in good health prior to the sudden attack. In terms of thyroid activity, indicated by T3 hormone levels, we found a gradual increase over several months in the whales that died of unknown causes. This pattern is not typically expected, as stress usually suppresses thyroid function. This anomaly could suggest an adaptive response to maintain body temperature and metabolism in potentially malnourished whales. Regarding the sex hormones, as expected for this age class, we found no significant fluctuations or spikes that would indicate sexual maturity in these young whales (Figure 3).

Figure 3. Longitudinal hormone profiles in an individual gray whale that died due to unknown causes (left) and one that died acutely due to orca predation (right). Note the pronounced elevations in cortisol levels (indicative of stress) and T3 prior to death in the case of unknown cause of death, while hormone levels remained low and constant prior to death in the whale acutely killed. Sex hormones do not present any clear oscillations, indicating that these whales were likely sexually immature. The red cross in the X-axis, indicate the time of death. Killer whale (Orcinus orca) illustration https://www.fisheries.noaa.gov/species/ killer-whale

Although commercial whaling is currently banned and several whale populations show evidence of recovery, today’s whales are exposed to a variety of other human stressors that cause significant lethal and non-lethal impacts (e.g., entanglement in fishing gear, vessel strikes, shipping noise, climate change, etc.; reviewed in Thomas et al., 2016). The recovery and conservation of large whale populations is particularly important to the oceanic environment due to their key ecological role and unique biological traits (See my previous post). Our research demonstrates the strengths of using baleen as a tool for the retrospective assessments of whale endocrinology and trophic ecology. As the Eastern North Pacific gray whale population faces recurring challenges, indicated by fluctuating numbers and unusual mortality events, innovative techniques like the baleen analysis presented here, are essential to investigate the causes of mortality and inform management, helping us understand not only the immediate causes of death but also broader environmental and ecological changes affecting their survival. Broadly implementing this approach with a greater sample size of baleen collected across a larger spatial and temporal range could significantly improve our strategies for conservation and management of baleen whales.

Did you enjoy this blog? Want to learn more about marine life, research, and
conservation? Subscribe to our blog and get a weekly message when we post a new
blog. Just add your name and email into the subscribe box below.

Loading

References

Christiansen, F., Rodríguez-González, F., Martínez-Aguilar, S., Urbán, J., Swartz, S., Warick, H., Vivier, F., & Bejder, L. (2021). Poor body condition associated with an unusual mortality event in gray whales. Marine Ecology Progress Series, 658, 237–252. https://doi.org/10.3354/meps13585

Gulland, F. M. D., & Stockin, K. A. (2020). Harmonizing global strandings response. European Cetacean Society Special Publication Series.

Gulland, F., Pérez-Cortés, H., Urbán, J. R., Rojas-Bracho, L., Ylitalo, G., Weir, J., Norman, S., Muto, M., Rugh, D., Kreuder, C., & Rowles, T. (2005). Eastern North Pacific gray whale (Eschrichtius robustus) unusual mortality event, 1999-2000. U.S. Department of Commerce. NOAA Technical Memorandum. NMFS-AFSC-150., March, 33 pp. http://www.afsc.noaa.gov/publications/AFSC-TM/NOAA-TM-AFSC-150.pdf

Martínez-Aguilar, S., Mariano-Meléndez, E., López-Paz, N., Castillo-Romero, F., Zaragoza-aguilar, G. A., Rivera-Rodriguez, J., Zaragoza-Aguilar, A., Swartz, S., Viloria-Gómora, L., & Urbán, J. R. (2019). Gray whale (Eschrichtius robustus) stranding records in Mexico during the winter breeding season in 2019. Report of the International Whaling Commission. Document SC/68A/CMP/14, May.

Stewart, J. D., Joyce, T. W., Durban, J. W., Calambokidis, J., Fauquier, D., Fearnbach, H., Grebmeier, J. M., Lynn, M., Manizza, M., Perryman, W. L., Tinker, M. T., & Weller, D. W. (2023). Boom-bust cycles in gray whales associated with dynamic and changing Arctic conditions. Science, 382(6667), 207–211. https://doi.org/10.1126/science.adi1847

Torres, L. G., Bird, C. N., Rodríguez-González, F., Christiansen, F., Bejder, L., Lemos, L., Urban R, J., Swartz, S., Willoughby, A., Hewitt, J., & Bierlich, KC. (2022). Range-Wide Comparison of Gray Whale Body Condition Reveals Contrasting Sub-Population Health Characteristics and Vulnerability to Environmental Change. Frontiers in Marine Science, 9. https://doi.org/10.3389/fmars.2022.867258

Urbán, R. (2020). Gray whale stranding records in Mexico during the 2020 winter breeding season. Unpublished Paper SC/68B/CMP/13 Presented to the IWC Scientific Committee, Cambridge.

A MOSAIC of species, datasets, tools, and collaborators

By Dr. Dawn Barlow, Postdoctoral Scholar, OSU Department of Fisheries, Wildlife, and Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

Imagine you are 50 nautical miles from shore, perched on the observation platform of a research vessel. The ocean is blue, calm, and seems—for all intents and purposes—empty. No birds fly overhead, nothing disturbs the rolling swells except the occasional whitecap from a light breeze. The view through your binoculars is excellent, and in the distance, you spot a disturbance at the surface of the water. As the ship gets closer, you see splashing, and a flurry of activity emerges as a large group of dolphins leap and dive, likely chasing a school of fish. They swim along with the ship, riding the bow-wave in a brief break from their activity. Birds circle in the air above them and float on the water around them. Together with your team of observers, you rush to record the species, the number of animals, their distance to the ship, and their behavior. The research vessel carries along its pre-determined trackline, and the feeding frenzy of birds and dolphins fades off behind you as quickly as it came. You return to scanning the blue water.

Craig Hayslip and Dawn Barlow scan for marine mammals from the crow’s nest (elevated observation platform) of the R/V Pacific Storm.

The marine environment is highly dynamic, and resources in the ocean are notoriously patchy. One of our main objectives in marine ecology is to understand what drives these ephemeral hotspots of species diversity and biological activity. This objective is particularly important now as the oceans warm and shift. In the context of rapid global climate change, there is a push to establish alternatives to fossil fuels that can support society’s energy needs while minimizing the carbon emissions that are a root cause of climate change. One emergent option is offshore wind, which has become a hot topic on the West Coast of the United States in recent years. The technology has the potential to supply a clean energy source, but the infrastructure could have environmental and societal impacts of its own, depending on where it is placed, how it is implemented, and when it is operational.

Northern right whale dolphins leap into the air. Photo by Craig Hayslip.

Any development in the marine environment, including alternative energy such as offshore wind, should be undertaken using the best available scientific knowledge of the ecosystem where it will be implemented. The Marine Mammal Institute’s collaborative project, Marine Offshore Species Assessments to Inform Clean energy (MOSAIC), was designed for just this reason. As the name “MOSAIC” implies, it is all about using different tools to compile different datasets to establish crucial baseline information on where marine mammals and seabirds are distributed in Oregon and Northern California, a region of interest for wind energy development.

A MOSAIC of species

The waters of Oregon and Northern California are rich with life. Numerous cetaceans are found here, from the largest species to ever live, the blue whale, to one of the smallest cetaceans, the harbor porpoise, with many species filling in the size range in between: fin whales, humpback whales, sperm whales, killer whales, Risso’s dolphins, Pacific white-sided dolphins, northern right whale dolphins, and Dall’s porpoises, to name a few. Seabirds likewise rely on these productive waters, from the large, graceful albatrosses that feature in maritime legends, to charismatic tufted puffins, to the little Leach’s storm petrels that could fit in the palm of your hand yet cover vast distances at sea. From our data collection efforts so far, we have already documented 16 cetacean species and 64 seabird species.

A Laysan albatross glides over the water’s surface. Photo by Will Kennerley.

A MOSAIC of data and tools

Schematic of the different components of the MOSAIC project. Graphic created by Solene Derville.

Through the four-year MOSAIC project, we are undertaking two years of visual surveys and passive acoustic monitoring from Cape Mendocino to the mouth of the Columbia River on the border of Oregon and Washington and seaward to the continental slope. Six comprehensive surveys for cetaceans and seabirds are being conducted aboard the R/V Pacific Storm following a carefully chosen trackline to cover a variety of habitats, including areas of interest to wind energy developers.

These dedicated surveys are complemented by additional surveys conducted aboard NOAA research vessels during collaborative expeditions in the Northern California Current, and ongoing aerial surveys in partnership with the United States Coast Guard through the GEMM Lab’s OPAL project. Three bottom-mounted hydrophones were deployed in August 2022, and are recording cetacean vocalizations and the ambient soundscape, and these recordings will be complemented by acoustic data that is being collected continuously by the Oceans Observing Initiative. In addition to these methods to collect broad-scale species distribution information, concurrent efforts are being conducted via small boats to collect individual identification photographs of baleen whales and tissue biopsy samples for genetic analysis. Building on the legacy of satellite tracking here at the Marine Mammal Institute, the MOSAIC project is breathing new life into tag data from large whales to assess movement patterns over many years and determine the amount of time spent within our study area.

A curious fin whale approaches the R/V Pacific Storm during one of the visual surveys. Photo by Craig Hayslip.
Survey tracklines extending between the Columbia River and Cape Mendocino, designed for the MOSAIC visual surveys aboard the R/V Pacific Storm.

The resulting species occurrence data from visual surveys and acoustic monitoring will be integrated to develop Species Distribution Models for the many different species in our study region. Identification photographs of individual baleen whales, DNA profiles from whale biopsy samples, and data from satellite-tagged whales will provide detailed insight into whale population structure, behavior, and site fidelity (i.e., how long they typically stay in a given area), which will add important context to the distribution data we collect through the visual surveys and acoustic monitoring. The models will be implemented to produce maps of predicted species occurrence patterns, describing when and where we expect different cetaceans and seabirds to be under different environmental conditions.

With five visual surveys down, the MOSAIC team is gearing up for one final survey this month. The hydrophones will be retrieved this summer. Then, with data in-hand, the team will dive deep into analysis.

A MOSAIC of collaborators

The MOSAIC-4 team waves from the crow’s nest (observation platform) of the R/V Pacific Storm. Photo by Craig Hayslip.

The collaborative MOSAIC team brings together a diverse set of tools. The depth of expertise here at the Marine Mammal Institute spans a broad range of disciplines, well-positioned to provide robust scientific knowledge needed to inform alternative energy development in Oregon and Northern California waters.  

I have had the pleasure of participating in three of the six surveys aboard the R/V Pacific Storm, including leading one as Chief Scientist, and have collected visual survey data aboard NOAA Ship Bell M. Shimada and from United States Coast Guard helicopters over the years that will be incorporated in the MOSAIC of datasets for the project. This ecosystem is one that I feel deeply connected to from time spent in the field. Now, I am thrilled to dive into the analysis, and will lead the modeling of the visual survey data and the integration of the different components to produce species distribution maps for cetaceans and seabirds our study region.

This project is funded by the United States Department of Energy. The Principal Investigator is the Institute’s Director Dr. Lisa Ballance, and Co-Principal Investigators include Scott Baker, Barbara Lagerquist, Rachael Orben, Daniel Palacios, Kate Stafford, and Leigh Torres of the Marine Mammal Institute; John Calambokidis of the Cascadia Research Collective; and Elizabeth Becker of ManTech International Corp. For more information, please visit the project website, and stay tuned for updates as we enter the analysis phase.

Did you enjoy this blog? Want to learn more about marine life, research, and conservation? Subscribe to our blog and get a weekly alert when we make a new post! Just add your name into the subscribe box below!

Loading

An ‘X’travaganza! Introducing the Marine Mammal Institute’s Center of Drone Excellence (CODEX)

Dr. KC Bierlich, Postdoctoral Scholar, OSU Department of Fisheries, Wildlife, & Conservation Sciences, Geospatial Ecology of Marine Megafauna Lab

Drones are becoming more and more prevalent in marine mammal research, particularly for non-invasively obtaining morphological measurements of cetaceans via photogrammetry to identify important health metrics (see this and this previous blog). For example, the GEMM Lab uses drones for the GRANITE Project to study Pacific Coast Feeding Group (PCFG) gray whales and we have found that PCFG whales are skinnier and morphologically shorter with smaller skulls and flukes compared to the larger Eastern North Pacific (ENP) population. The GEMM Lab has also used drones to document variation in body condition across years and within a season, to diagnose pregnancy, and even measure blowholes.

While drone-based photogrammetry can provide major insight into cetacean ecology, several drone systems and protocols are used across the scientific community in these efforts, and no consistent method or centralized framework is established for quantifying and incorporating measurement uncertainty associated with these different drones. This lack of standardization restricts comparability across datasets, thus hindering our ability to effectively monitor populations and understand the drivers of variation (e.g., pollution, climate change, injury, noise).

We are excited to announce the Marine Mammal Institute’s (MMI) Center of Drone Excellence (CODEX), which focuses on developing analytical methods for using drones to non-invasively monitor marine mammal populations. CODEX is led by GEMM Lab member’s KC Bierlich, Leigh Torres, and Clara Bird and consists of other team members within and outside OSU. We draw from many years of trials, errors, headaches, and effort working with drones to study cetacean ecology in a variety of habitats and conditions on many different species.

Already CODEX has developed several open-source hardware and software tools. We developed, produced, and published LidarBoX (Bierlich et al., 2023), which is a 3D printed enclosure for a LiDAR altimeter system that can be easily attached and swapped between commercially available drones (i.e., DJI Inspire, DJI Mavic, and DJI Phantom) (Figure 1). Having a LidarBoX installed helps researchers obtain altitude readings with greater accuracy, yielding morphological measurements with less uncertainty. Since we developed LidarBoX, we have received over 35 orders to build this unit for other labs in national and international universities.

Figure 1. A ‘LidarBoX’ attached to a DJI Inspire 2. The LidarBoX is a 3D printed enclosure containing a LiDAR altimeter to help obtain more accurate altitude readings.

Additionally, CODEX recently released MorphoMetriX version 2 (v2), an easy-to-use photogrammetry software that provides users with the flexibility to obtain custom morphological measurements of megafauna in imagery with no knowledge of any scripting language (Torres and Bierlich, 2020). CollatriX is a user-friendly software for collating multiple MorphoMetriX outputs into a single dataframe and linking important metadata to photogrammetric measurements, such as altitude measured with a LidarBoX (Bird and Bierlich, 2020). CollatriX also automatically calculates several body condition metrics based on measurements from MorphoMetriX v2. CollatriX v2 is currently in beta-testing and scheduled to be released late Spring 2024. 

Figure 2. An example of a Pygmy blue whale imported into MorphoMetriX v2, open-source photogrammetry software. 

CODEX also recently developed two automated tools to help speed up the laborious manual processing of drone videos for obtaining morphological measurements (Bierlich & Karki et al., in revision). DeteX is a graphical user interface (GUI) that uses a deep learning model for automated detection of cetaceans in drone-based videos. Researchers can input their drone-based videos and DeteX will output frames containing whales at the surface. Users can then select which frames they want to use for measuring individual whales and then input these selected frames into XtraX, which is a GUI that uses a deep learning model to automatically extract body length and body condition measurements of cetaceans (Figure 4). We found automated measurements from XtraX to be similar (within 5%) of manual measurements. Importantly, using DeteX and XtraX takes about 10% of the time it would take to manually process the same videos, demonstrating how these tools greatly speed up obtaining key morphological data while maintaining accuracy, which is critical for effectively monitoring population health.

Figure 3. An example of an automated body length (top) and body condition (bottom) measurement of a gray whale using XtraX (Bierlich & Karki et al., in revision).

CODEX is also in the process of developing Xcertainty, an R package that uses a Bayesian statistical model to quantify and incorporate uncertainty associated with measurements from different drones (see this blog). Xcertainty is based on the Bayesian statistical model developed by Bierlich et al., (2021b; 2021a), which has been utilized by many studies with several different drones to compare body condition and body morphology across individuals and populations  (Bierlich et al., 2022; Torres et al., 2022; Barlow et al., 2023). Rather than a single point-estimate of a length measurement for an individual, Xcertainty produces a distribution of length measurements for an individual so that the length of a whale can be described by the mean of this distribution, and its uncertainty as the the variance or an interval around the mean (Figure 4). These outputs ensure measurements are robust and comparable across different drones because they provide a measure of the uncertainty around each measurement. For instance, a measurement with more uncertainty will have a wider distribution. The uncertainty associated with each measurement can be incorporated into analyses, which is key when detecting important differences or changes in individuals or populations, such as changes in body condition (blog).

Figure 4. An example of a posterior predictive distribution for total length of an individual blue whale produced by the ‘Xcertainty’ R package. The black bars represent the uncertainty around the mean value (the black dot) – the longer black bars represent the 95% highest posterior density (HPD) interval, and the shorter black bars represent the 65% HPD interval. 

CODEX has integrated all these lessons learned, open-source tools, and analytical approaches into a single framework of suggested best practices to help researchers enhance the quality, speed, and accuracy of obtaining important morphological measurements to manage vulnerable populations. These tools and frameworks are designed to be accommodating and accessible to researchers on various budgets and to facilitate cross-lab collaborations. CODEX plans to host workshops to educate and train researchers using drones on how to apply these tools within this framework within their own research practices. Potential future directions for CODEX include developing a system for using drones to drop suction-cup tags on whales and to collect thermal imagery of whales for health assessments. Stay up to date with all the CODEX ‘X’travaganza here: https://mmi.oregonstate.edu/centers-excellence/codex.  

Huge shout out to Suzie Winquist for designing the artwork for CODEX!

References

Barlow, D.R., Bierlich, K.C., Oestreich, W.K., Chiang, G., Durban, J.W., Goldbogen, J.A., Johnston, D.W., Leslie, M.S., Moore, M.J., Ryan, J.P. and Torres, L.G., 2023. Shaped by Their Environment: Variation in Blue Whale Morphology across Three Productive Coastal Ecosystems. Integrative Organismal Biology, [online] 5(1). https://doi.org/10.1093/iob/obad039.

Bierlich, K., Karki, S., Bird, C.N., Fern, A. and Torres, L.G., n.d. Automated body length and condition measurements of whales from drone videos for rapid assessment of population health. Marine Mammal Science.

Bierlich, K.C., Hewitt, J., Bird, C.N., Schick, R.S., Friedlaender, A., Torres, L.G., Dale, J., Goldbogen, J., Read, A.J., Calambokidis, J. and Johnston, D.W., 2021a. Comparing Uncertainty Associated With 1-, 2-, and 3D Aerial Photogrammetry-Based Body Condition Measurements of Baleen Whales. Frontiers in Marine Science, 8. https://doi.org/10.3389/fmars.2021.749943.

Bierlich, K.C., Hewitt, J., Schick, R.S., Pallin, L., Dale, J., Friedlaender, A.S., Christiansen, F., Sprogis, K.R., Dawn, A.H., Bird, C.N., Larsen, G.D., Nichols, R., Shero, M.R., Goldbogen, J., Read, A.J. and Johnston, D.W., 2022. Seasonal gain in body condition of foraging humpback whales along the Western Antarctic Peninsula. Frontiers in Marine Science, 9(1036860), pp.1–16. https://doi.org/10.3389/fmars.2022.1036860.

Bierlich, K.C., Schick, R.S., Hewitt, J., Dale, J., Goldbogen, J.A., Friedlaender, A.S. and Johnston, D.W., 2021b. Bayesian approach for predicting photogrammetric uncertainty in morphometric measurements derived from drones. Marine Ecology Progress Series, 673, pp.193–210. https://doi.org/10.3354/meps13814.

Bird, C. and Bierlich, K.C., 2020. CollatriX: A GUI to collate MorphoMetriX outputs. Journal of Open Source Software, 5(51), pp.2323–2328. https://doi.org/10.21105/joss.02328.

Torres, L.G., Bird, C.N., Rodríguez-González, F., Christiansen, F., Bejder, L., Lemos, L., Urban R, J., Swartz, S., Willoughby, A., Hewitt, J. and Bierlich, K.C., 2022. Range-Wide Comparison of Gray Whale Body Condition Reveals Contrasting Sub-Population Health Characteristics and Vulnerability to Environmental Change. Frontiers in Marine Science, 9(April), pp.1–13. https://doi.org/10.3389/fmars.2022.867258.

Torres, W. and Bierlich, K.C., 2020. MorphoMetriX: a photogrammetric measurement GUI for morphometric analysis of megafauna. Journal of Open Source Software, 5(45), pp.1825–1826. https://doi.org/10.21105/joss.01825.