{"id":5019,"date":"2023-01-16T14:59:41","date_gmt":"2023-01-16T21:59:41","guid":{"rendered":"https:\/\/blogs.oregonstate.edu\/gemmlab\/?p=5019"},"modified":"2023-01-16T14:59:41","modified_gmt":"2023-01-16T21:59:41","slug":"keeping-it-simple-a-lesson-in-model-construction","status":"publish","type":"post","link":"https:\/\/blogs.oregonstate.edu\/gemmlab\/2023\/01\/16\/keeping-it-simple-a-lesson-in-model-construction\/","title":{"rendered":"Keeping it simple: A lesson in model construction"},"content":{"rendered":"\n<p><em>By: <a href=\"https:\/\/mmru.ubc.ca\/personnel\/kate-colson\/\">Kate Colson<\/a>, MSc Student, University of British Columbia, Institute for the Oceans and Fisheries, <a href=\"https:\/\/mmru.ubc.ca\/\">Marine Mammal Research Unit<\/a><\/em><\/p>\n\n\n\n<p>Models can be extremely useful tools to describe biological systems and answer ecological questions, but they are often tricky to construct. If I have learned anything in my statistics classes, it is the importance of resisting the urge to throw everything but the kitchen sink into a model. However, this is usually much easier said than done, and model construction takes a lot of practice. The principle of simplicity is currently at the forefront of my thesis work, as I try to embody the famous quote by Albert Einstein:<\/p>\n\n\n\n<p>&nbsp;\u201cEverything should be made as simple as possible, but no simpler.\u201d<\/p>\n\n\n\n<p>As you might remember from my earlier <a href=\"https:\/\/blogs.oregonstate.edu\/gemmlab\/2022\/09\/12\/putting-fitbits-on-whales-how-tag-data-allows-for-estimating-calories-burned-by-foraging-pcfg-gray-whales\/\">blog<\/a>, the goal of my thesis is to use biologging data to define different foraging behaviors of Pacific Coast Feeding Group (PCFG) gray whales, and then calculate the energetic cost of those behaviors. I am defining PCFG foraging behaviors at two scales: (1) dives that represent different behavior states (e.g., travelling vs foraging), and (2) roll events, which are periods during dives where the whale is rolled onto their side, that represent different foraging tactics (e.g., headstanding vs side-swimming).<\/p>\n\n\n\n<p>Initially, I was planning to use a clustering analysis to define these different foraging behaviors at both the dive and roll event scale, as this method has been used to successfully classify different foraging strategies for Galapagos sea lions (Schwarz et al., 2021). In short, this clustering analysis uses summary variables from events of interest to group events based on their similarity. These can be any metric that describes the event such as duration and depth, or body positioning variables like median pitch or roll. The output of the clustering analysis method results in groups of events that can each be used to define a different behavior.<\/p>\n\n\n\n<p>However, while this method works for defining the foraging tactics of PCFG gray whales, my discussions with other scientists have suggested that there is a better method available for defining foraging behavior at the dive scale: Hidden Markov Models (HMMs). HMMs are similar to the clustering method described above in that they use summary variables at discrete time scales to define behavior states, but HMMs take into account the bias inherent to time series data \u2013 events that occur closer together in time are more likely to be more similar. This bias of time can confound clustering analyses, making HMMs a better tool for classifying a series of dives into different behavior states.<\/p>\n\n\n\n<p>Like many analytical methods, the HMM framework was first proposed in a terrestrial system where it was used to classify the movement of translocated elk (Morales et al., 2004). The initial framework proposed using the step length, or the spatial distance between the animal\u2019s locations at the start of subsequent time intervals, and the corresponding turning angle, to isolate \u201cencamped\u201d from \u201cexploratory\u201d behaviors in each elk\u2019s movement path (Figure 1, from Morales et al., 2004). \u201cEncamped\u201d behaviors are those with short step lengths and high turning angles that show the individual is moving within a small area, and they can be associated with foraging behavior. On the other hand, \u201cexploratory\u201d behaviors are those with long step lengths and low turning angles that show the individual is moving in a relatively straight path and covering a lot of ground, which is likely associated with travelling behavior.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture1_Kate.png\"><img loading=\"lazy\" decoding=\"async\" width=\"774\" height=\"720\" src=\"https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture1_Kate.png\" alt=\"\" class=\"wp-image-5020\" srcset=\"https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture1_Kate.png 774w, https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture1_Kate-300x279.png 300w, https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture1_Kate-768x714.png 768w\" sizes=\"auto, (max-width: 774px) 100vw, 774px\" \/><\/a><figcaption class=\"wp-element-caption\"><em>Figure 1.<\/em> The difference between \u201cencamped\u201d and \u201cexploratory\u201d behavior states from a simple Hidden Markov Model (HMM) in a translocated elk equipped with a GPS collar (Fig. 1 in Morales et al., 2004). The top rose plots show the turning angles while the bottom histograms show the step lengths as a daily movement rate. The \u201cencamped\u201d state has short step lengths (low daily movement rate) and high turning angles while the \u201cexploratory\u201d state has long step lengths (high daily movement rate) and low turning angle. These behavior states from the HMM can then be interpollated to elk behavior, as the low daily movement and tight turns of the \u201cencamped\u201d behavior state likely indicates foraging while the high daily movement and direct path of the \u201cexploratory\u201d behavior state likely indicates traveling. Thus, it is important to keep the biological relevance of the study system in mind while constructing and interpreting the model.<\/figcaption><\/figure>\n<\/div>\n\n\n<p>In the two decades following this initial framework proposed by Morales et al. (2004), the use of HMMs in anlaysis has been greatly expanded. One example of this expansion has been the development of mutlivariate HMMs that include additional data streams to supplement the step length and turning angle classification of \u201cencamped\u201d vs \u201cexploratory\u201d states in order to define more behaviors in movement data. For instance, a multivariate HMM was used to determine the impact of acoustic disturbance on blue whales (DeRuiter et al., 2017). In addition to step length and turning angle, dive duration and maximum depth, the duration of time spent at the surface following the dive, the number of feeding lunges in the dive, and the variability of the compass direction the whale was facing during the dive were all used to classify behavior states of the whales. This not only allowed for more behavior states to be identified (three instead of two as determined in the elk model), but also the differences in behavior states between individual animals included in the study, and the differences in the occurrence of behavior states due to changes in environmental noise.<\/p>\n\n\n\n<p>The mutlivariate HMM used by DeRuiter et al. (2017) is a model I would ideally like to emulate with the biologging data from the PCFG gray whales. However, incorporating more variables invites more questions during the model construction process. For example, how many variables should be incorporated in the HMM? How should these variables be modeled? How many behavior states can be identified when including additional variables? These questions illustrate how easy it is to unnecessarily overcomplicate models and violate the principle of simiplicity toted by Albert Einstein, or to be overwhelmed by the complexity of these analytical tools.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture2_kate.png\"><img loading=\"lazy\" decoding=\"async\" width=\"872\" height=\"442\" src=\"https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture2_kate.png\" alt=\"\" class=\"wp-image-5021\" srcset=\"https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture2_kate.png 872w, https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture2_kate-300x152.png 300w, https:\/\/osu-wams-blogs-uploads.s3.amazonaws.com\/blogs.dir\/2115\/files\/2023\/01\/Picture2_kate-768x389.png 768w\" sizes=\"auto, (max-width: 872px) 100vw, 872px\" \/><\/a><figcaption class=\"wp-element-caption\"><em>Figure 2.<\/em> Example of expected output of Hidden Markov Model (HMM) for the PCFG gray whale biologging data (GEMM Lab; National Marine Fisheries Service (NMFS) permit no. 21678). The figure shows the movement track the whale swam during the deployment of the biologger, with each point representing the start of a dive. The axes show \u201cEasting\u201d and \u201cNorthing\u201d rather than map coordinates because this is the relative path the whale took rather than GPS coordinates of the whale\u2019s location. Each color represents a different behavior state\u2014blue has short step lengths and high turning angles (likely foraging), red has intermediate step lengths and turning angles (likely searching), and black has long step lengths and low turning angles (likely transiting). These results will be refined as I construct the multivariate HMM that will be used in my thesis. \u00a0<\/figcaption><\/figure>\n<\/div>\n\n\n<p>Luckily, I can draw on the support of <a href=\"https:\/\/mmi.oregonstate.edu\/gemm-lab\/granite-gray-whale-response-ambient-noise-informed-technology-ecology\">Gray whale Response to Ambient Noise Informed by Technology and Ecology<\/a> (GRANITE) project collaborators <a href=\"https:\/\/www.ursinus.edu\/live\/profiles\/5902-leslie-new\">Dr. Leslie New<\/a> and <a href=\"https:\/\/scholar.google.ca\/citations?user=n07X8FoAAAAJ&amp;hl=en&amp;oi=ao\">Dr. Enrico Pirotta<\/a> to guide my HMM model construction and assist in interpreting the outputs (Figure 2). With their help, I have been learning the importance of always asking if the change I am making to my model is biologically relevent to the PCFG gray whales, and if it will help give me more insight into the whales\u2019 behavior. Even though using complex tools, such as Hidden Markov Models, has a steep learning curve, I know that this approach is not only placing this data analysis at the cutting edge of the field, but helping me practice fundamental skills, like model construction, that will pay off down the line in my career.<\/p>\n\n\n<div class=\"emaillist\" id=\"es_form_f1-n1\"><form action=\"\/gemmlab\/wp-json\/wp\/v2\/posts\/5019#es_form_f1-n1\" method=\"post\" class=\"es_subscription_form es_shortcode_form  es_ajax_subscription_form\" id=\"es_subscription_form_69f48c7f2aea3\" data-source=\"ig-es\" data-form-id=\"1\"><div class=\"es-field-wrap\"><label>Name*<br \/><input type=\"text\" name=\"esfpx_name\" class=\"ig_es_form_field_name\" placeholder=\"\" value=\"\" required=\"required\" \/><\/label><\/div><div class=\"es-field-wrap ig-es-form-field\"><label class=\"es-field-label\">Email*<br \/><input class=\"es_required_field es_txt_email ig_es_form_field_email ig-es-form-input\" type=\"email\" name=\"esfpx_email\" value=\"\" placeholder=\"\" required=\"required\" \/><\/label><\/div><input type=\"hidden\" name=\"esfpx_lists[]\" value=\"e75fbcad40a2\" \/><input type=\"hidden\" name=\"esfpx_form_id\" value=\"1\" \/><input type=\"hidden\" name=\"es\" value=\"subscribe\" \/>\n\t\t\t<input type=\"hidden\" name=\"esfpx_es_form_identifier\" value=\"f1-n1\" \/>\n\t\t\t<input type=\"hidden\" name=\"esfpx_es_email_page\" value=\"5019\" \/>\n\t\t\t<input type=\"hidden\" name=\"esfpx_es_email_page_url\" value=\"https:\/\/blogs.oregonstate.edu\/gemmlab\/2023\/01\/16\/keeping-it-simple-a-lesson-in-model-construction\/\" \/>\n\t\t\t<input type=\"hidden\" name=\"esfpx_status\" value=\"Unconfirmed\" \/>\n\t\t\t<input type=\"hidden\" name=\"esfpx_es-subscribe\" id=\"es-subscribe-69f48c7f2aea3\" value=\"2433dd6234\" \/>\n\t\t\t<label style=\"position:absolute;top:-99999px;left:-99999px;z-index:-99;\" aria-hidden=\"true\"><span hidden>Please leave this field empty.<\/span><input type=\"email\" name=\"esfpx_es_hp_email\" class=\"es_required_field\" tabindex=\"-1\" autocomplete=\"-1\" value=\"\" \/><\/label><input type=\"submit\" name=\"submit\" class=\"es_subscription_form_submit es_submit_button es_textbox_button\" id=\"es_subscription_form_submit_69f48c7f2aea3\" value=\"Subscribe\" \/><span class=\"es_spinner_image\" id=\"spinner-image\"><img decoding=\"async\" src=\"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-content\/plugins\/email-subscribers\/lite\/public\/images\/spinner.gif\" alt=\"Loading\" \/><\/span><\/form><span class=\"es_subscription_message \" id=\"es_subscription_message_69f48c7f2aea3\" role=\"alert\" aria-live=\"assertive\"><\/span><\/div>\n\n\n\n<h1 class=\"wp-block-heading\">Sources<\/h1>\n\n\n\n<p>DeRuiter, S. L., Langrock, R., Skirbutas, T., Goldbogen, J. A., Calambokidis, J., Friedlaender, A. S., &amp; Southall, B. L. (2017). A multivariate mixed Hidden Markov Model for blue whale behaviour and responses to sound exposure. <em>Annals of Applied Statistics<\/em>, <em>11<\/em>(1), 362\u2013392. https:\/\/doi.org\/10.1214\/16-AOAS1008<\/p>\n\n\n\n<p>Morales, J. M., Haydon, D. T., Frair, J., Holsinger, K. E., &amp; Fryxell, J. M. (2004). Extracting more out of relocation data: Building movement models as mixtures of random walks. <em>Ecology<\/em>, <em>85<\/em>(9), 2436\u20132445. https:\/\/doi.org\/10.1890\/03-0269<\/p>\n\n\n\n<p>Schwarz, J. F. L., Mews, S., DeRango, E. J., Langrock, R., Piedrahita, P., P\u00e1ez-Rosas, D., &amp; Kr\u00fcger, O. (2021). Individuality counts: A new comprehensive approach to foraging strategies of a tropical marine predator. <em>Oecologia<\/em>, <em>195<\/em>(2), 313\u2013325. https:\/\/doi.org\/10.1007\/s00442-021-04850-w<\/p>\n","protected":false},"excerpt":{"rendered":"<p>By: Kate Colson, MSc Student, University of British Columbia, Institute for the Oceans and Fisheries, Marine Mammal Research Unit Models can be extremely useful tools to describe biological systems and answer ecological questions, but they are often tricky to construct. If I have learned anything in my statistics classes, it is the importance of resisting &hellip; <a href=\"https:\/\/blogs.oregonstate.edu\/gemmlab\/2023\/01\/16\/keeping-it-simple-a-lesson-in-model-construction\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Keeping it simple: A lesson in model construction<\/span><\/a><\/p>\n","protected":false},"author":10751,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[636310],"tags":[195384,677522,634945,1310780,148762],"class_list":["post-5019","post","type-post","status-publish","format-standard","hentry","category-gray-whale-foraging-ecology-and-vessel-disturbance","tag-field-work","tag-foraging-ecology","tag-gray-whales","tag-kate-colson","tag-oregon-coast"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"post_mailing_queue_ids":[],"_links":{"self":[{"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/posts\/5019","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/users\/10751"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/comments?post=5019"}],"version-history":[{"count":1,"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/posts\/5019\/revisions"}],"predecessor-version":[{"id":5022,"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/posts\/5019\/revisions\/5022"}],"wp:attachment":[{"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/media?parent=5019"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/categories?post=5019"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.oregonstate.edu\/gemmlab\/wp-json\/wp\/v2\/tags?post=5019"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}