GEOG 566

         Advanced spatial statistics and GIScience

Archive for My Spatial Problem

April 24, 2017

Japanese tsunami marine debris biota

Filed under: 2017,My Spatial Problem @ 12:49 pm


Research problem: Japanese Tsunami Marine Debris species

Six years ago, the devastating Tohoku earthquake and tsunami struck the coast of Japan. Since then, it has become evident that hundreds of coastal species from Japan have crossed the Pacific Ocean on tsunami debris, including species that have become invasive and been known to cause ecosystem/economic damage elsewhere. As of January 2017, scientists have documented the arrival of >650 debris items, referred to as Japanese Tsunami Marine Debris Biofouling Items (JTMD-BF) 1-651. Debris items include docks, buoys, boats, pallets, and wooden structures. These items were identified as JTMD if it 1) had clear identification such as a serial or registration number that was linked to an object lost during the tsunami of 2011; 2) had clear biological evidence of originating primarily from the Tohoku coast of Japan; or 3) a combination of these factors. The BF items are a subset of all the debris lost, as some debris items that were known to be lost from Japan remain undocumented – it is possible they ended up in remote locations.  A huge effort by taxonomists to identify the species on JTMD has generated a comprehensive species list. Currently, there are around 300 taxa that have been collected on JTMD from North American and Hawai`ian coastlines since 2012. I am interested in looking at the geographic distributions of the species with invasion history compared to those without invasion history.

Dataset: JTMD Database

My work is part of an international effort to evaluate the risks associated with JTMD and associated species.  I contributed to the development of a database of life history, distributional, and environmental attributes of many JTMD species. This database is being used for reference and analysis, and can aid in efforts to assess the risk associated with JTMD species, and in determining which species are of highest concern for establishment in certain areas. The database contains a little over 100 JTMD species, with 26 attributes per species. It has information for native geographic regions and non-native (if applicable) geographic regions. The regions are the Marine Ecoregions of the World. It also has survival temperature and salinity regimes, reproductive information, habitat, depth, trophic level, abundance data, and more. The database does not include temporal information, but these species arrived on debris items in pulses for years after the tsunami event, and that is another interesting aspect of the phenomenon.

Hypotheses: While the JTMD species are largely native to the Northwest Pacific (near Japan), they are also native to regions from all over the world. I hypothesize that most species without invasion history are native to the NW Pacific, and those with invasion history are native to regions all over the world. I expect to see this represented when their native regions are mapped. For the species that have been found outside their native range, I expect to see that some are only just outside native ranges in a few areas, but some with a history of invasion are more globally distributed.

Approaches: I have never used GIS, and would like to play around with ArcGIS to better learn the program. I would like to get a hold of the Marine Ecoregions of the World (MEOW) shape file in order to put it into ArcGIS, and overlay the JTMD geographic distributional history on top of those regions. The species data has been analyzed using PC-ORD, in trait space. And the species with invasion history were compared with those without, and no significant difference in life history traits were found.

Expected outcome: I want to produce maps of the JTMD geographic distributions. If I can work in statistical relationships as well with the data I have, that would also be beneficial to my understanding of the phenomenon.

Significance: Human-mediated transport of marine species across the globe through ballast water and hull fouling has been a concern for some time. JTMD is unique in comparison to these marine vectors, in that it can transport large numbers of marine species across ocean basins. Shipping routes are direct and arrive in known locations and at measurable frequencies whereas JTMD, which is propelled by winds and currents and travels at much slower speeds than ships, can arrive almost anywhere at any time. JTMD is potentially the transport vector with the most random distribution yet described. Due to the slow rates of transport by currents rather than propulsion, the effects of drag and dislodgement are reduced on JTMD in comparison with ship hull Furthermore, JTMD transports large numbers of adults, rather than larval stages that are more common in ballast water. As of January 2017, only one JTMD species, the striped beakfish Oplegnathus fasciatus, has been observed free-living in along the west coast of North America (in Oregon and Washington). At this time, we do not know if any of these JTMD species will become established outside of their current distributional range as a result of the earthquake and tsunami. But by studying the JTMD species invasion histories, and geographic distributions of both native and non-native areas, we can better understand this large dispersal event, and inform vector management response for future events.

Experience: I am a beginner in spatial analysis, and have no experience with Arc-Info, Modelbuilder or GIS programming in Python. I do have a little bit of experience in R, at least I used it when I took ST 511 and 512, but all the coding was handed to us, we just had to copy and paste it in to R. Being a beginner I will have to do a lot of self-exploration, tutorial watching, and asking for help from peers and teachers.


April 10, 2017

Assessment of Ecohydrologic Dynamics of the Mill Creek Municipal Watershed

Filed under: 2017,My Spatial Problem @ 7:14 pm


The overall objective of my M.S. Thesis research is to disentangle the potential effects of water diversions, forest management, and climate change on the hydrology of the Mill Creek watershed, a transboundary catchment straddling the Eastern Oregon and Washington border. Since 1989, 8,798 hectares of the headwaters of this catchment have co-managed by the U.S. Forest Service and City of Walla Walla “to provide water at a level of quality and quantity” for the City’s municipal supply [U.S. Forest Service, 1990]. Despite the management protections afforded this upper catchment, minimum instream flow recommendations by are rarely met at the most upstream monitored gauge for Mill Creek. At the base of the Municipal Watershed, a small impoundment dam has been constructed, where an intake structure generates power and carries water withdrawals through a pipeline 22 km to the municipal water treatment plant in Walla Walla. The municipal intake structure is roughly 7 km upstream of the first stream gauge, USGS 14013000, and although it was firs installed in 1913, there is no long-term record of the natural flow regime without municipal alteration.

Downstream of the municipal catchment, water rights have been historically overallocated beyond available stream discharge during periods of high demand. This is particularly a problem during the late summer months when peak water use by municipal and agricultural consumers coincides with patterns of seasonal low flow. Columbia Basin Bull Trout (Salvelinus confluentus) and Mid-Columbia Basin Steelhead (Onchorhyncus mykiss) are native to the Mill Creek Basin and have been listed as threatened under the Endangered Species Act since 1998, and 1999 respectively [Schaller et al., 2014]. Stakeholder interests are actively working to implement basin-scale flow planning to protect critical habitat for these species, and ensure flow volumes sufficient to ensure migratory passage to spawning and rearing grounds in the headwaters. To protect wildlife, fish, scenic, aesthetic, and other environmental and navigational values, minimum instream flow recommendations for Mill Creek were first established by Washington Administrative Code (WAC 173-532-030) in 1977, and revised in 2008.

An Instream flow” is broadly defined as a target flow rate, typically measured in cubic feet per second (cfs), at a specific gauging point for a defined period-of-time, often with seasonal variations [Geller, 2003] . Despite the management protections afforded this upper catchment, minimum instream flow recommendations are rarely met at the most upstream monitored gauge for the Mill Creek catchment (USGS14013000). For my master’s thesis, I am interested in teasing apart the relative role of anthropogenic development and natural ecohydrological trends which may be influencing stream discharge over time. I am interested in examining patterns in the long term hydrologic gauging record and satellite image record, to examine the empirical relationships between vegetation on streamflow discharge during summer months where minimum instream flow recommendations are rarely met.

Primary Spatial Questions to be addressed in GEOG566:

  • How has vegetation in the upslope accumulated area of the USGS Gauge 14013000 changed over the period of satellite record?
  • Is the observed seasonal change in “greenness”, as measured by NDVI, related to change in water discharge observed at the 14013000 stream gauge over the spring and summer season?
  • Does this relationship change with and without the addition of known withdrawals at the City of Walla Walla’s Intake Structure?
  • Secondary questions, that have a spatial component include: how is the management of the watershed restricting availability?


To do this we are using a simplistic conceptual model, based on the Water Balance Equation (Eq :1), to tease apart relationships of eco-hydrologic variables in the Mill Creek Drainage.

Eq 1:            Q = P – ET      →      ΔQ = ΔP – ΔET       →        ΔQ/ ΔP = ΔET

Where Q is discharge, P is Precipitation, and ET is Evapotranspiration. We are omitting groundwater and storage components from the initial analysis, although they may be included in future analysis

Description of datasets:

Q (discharge) at the 14013000 Kooskooskie Gauge on Mill Creek is available through the long-term USGS National Water Information System gauging record dating back to 1913. In addition, we have withdrawal data from the City of Walla Walla Public Utilities dating to 2000, to approximate an adjusted natural flow regime.

P (precipitation) data has been obtained from gauging stations at the Walla Walla Airport (1949- Present), and Milton Freewater (1928-Present), from the United States Historical Climatology Network, available online from Carbon Dioxide Information Analysis Center.

ET (evapotranspiration) has not been directly measured, however is an important flux in the water balance equation. Total system ET is a combined representation of the antmospheric flux of intercepted water, transpired water, soil water, and surface water.  Often there is a large difference between estimated Potential Evapotranspiration (PET), and the Actual Evapotranspiration (AET), which is a biophysical function relative to available of soil moisture, species physiology, and climatic variables. Water limited forest biomes often exhibit a water balance deficit, where potential summer atmospheric and plant demands exceed available soil moisture [Littell et al., 2010]. This ultimately will constrain photosynthesis, leading to senesce or entrance into dormant states where rates of ET decline. Generally, trees have higher rates of transpiration, as they have larger surface areas, are exposed to turbulent air flow, and root into deeper soil profiles with accessing moisture later into the season [Littell et al., 2010]. Based on these known physiological traits, I am interested in deriving the empirical relationship of ET to Multispectral Vegetative Indices as a proxy to estimate relative ET during the months of highest water stress.

The specific index utilized for this project is NDVI, the Normalized Differential Vegetative Index (NDVI) (Eq: 2), red (R) and near-infrared (NIR) surface reflectance detected by the satellite in question.

Eq 2:     NDVI= (NIR -R) / (NIR + R)

For Landsat ™ (4-7) imagery, this equation is achieved using the following bands.:

Eq 3:     NDVI = (Band 4 – Band 3) / (Band 4 + Band 3).

What does NDVI really-tell-us, and is it an appropriate measure to proxy for ET? NDVI is calculated Interpretation of NDVI value relies on the characteristics of healthy vegetation to absorb red light (for energy and photosynthesis) and reflect near-infrared light. NDVI values range from -1 to 1, are interpreted as an indicator of “Vegetative Greenness”. Greenness may change across at vegetative catchment can be linked to Eco-physiological factors of different vegetative communities and their phenology in relation to the water cycle. Pairwise dates were selected to best measure the difference in vegetation distribution following the post snowmelt green-up period, to the end of the dry season, where summer water deficit may impact ET rates.

Eq 4: NDVI Spring – NDVI Late Summer = Seasonal ΔNDVI

For the time change between each pairwise image, a corresponding change was also assessed for avaliable precipitation records and the the stream discharge record at the USGS gauge. Cumulative Discharge (cfs) was measured from Time 1 to Time 2, and converted to change in Mean Daily Discharge (by dividing the # of days in the period). This value was then normalized, by dividing measurements by the gauge drainage area of 154.36 sq km.

Cumulative Precipitation (mm) from Time 1 to Time 2 was assessed for each available set of pairwise images, and repeated or both the Milton Freewater and Walla Walla Precipitation Gauges.

ΔQ (mm/day)   –>  ΔP (mm/day)  –>   ΔQ/ΔP (mm/mm)

In my initial analysis, we looked at this empirical relationship across the entire drainage area of the 14013000 gauge. There was no clear statistical relationship in this early analysis, however, upon examination of the imagery, there appears to be clear topographic controls and perhaps vegetative patterns in seasonal pairs, and across years. For GEOG 566 I am interested in expanding these analyses to address smaller subsections of my catchment.


Intra-annual (seasonal) Change: HO: Areas with negative or low ΔNDVI values from seasonal change will include riparian forest lands and north facing slopes which will retain greenness or even increase in greenness over the course of the growing seasons, while areas with high ΔNDVI values will have encountered soil water deficits and have lower ET demand late in the Summer Season.

Inter-annual Change:  HO: Aforementioned patterns are consistent from one year to the next.

HA: There are expansions, or declines of certain regions of NDVI response, indicating potential shifts in species distribution.


HO: ΔNDVI statistics summarized by vegetation class polygons show clear spatial patterns

HA: ΔNDVI statistics summarized by vegetation class polygons show no clear spatial patterns

HO: Unsupervised classification of ΔNDVI shows similar patterns to the USFS vegetation class polygons

HA: Unsupervised classification of ΔNDVI do not spatially align with USFS vegetation class polygons

Expected outcomes:

I would like to continue refining my maps describing the spatial context for vegetation change, and establish whether there are any valid statistical relationships between vegetation/landscape change and the available hydrologic records.


The discipline of forest hydrology has a long history of studying the effects of vegetation in regulating stream-flow through evapotranspiration. In increasingly tightly regulated water environment, water resource managers are interested in the relative role of water use by trees and shrubs, and whether species distributions are changing over the catchment scale. At a regional scale, species distributions are often impacted by the ecohydrological balance between cool season surplus, and summer water deficit [Littell et al., 2010]. The management actions within the Municipal Catchment have preserved much of the natural character of the upper basin’s stream channel and riparian floodplain, thus retaining critical spawning and rearing habitat for several aquatic species of regional concern.The inability to meant instream flow recommendations at the Mill Creek Gauge USGS14013000 has, located high in the Mill Creek Basin before many of the largest water rights holders, raises questions about the future ability to manage the withdrawal structure for recommended instream flows under predicted future climate scenarios.  This information is of interest to resource managers from the City of Walla Walla Public Utilities, as well as collaborative basin planning organizations such as the Walla Walla Basin Watershed Council.

Your level of preparation:

Entering into this course, I feel adequately prepared to work within the ESRI suite of Arc-GIS software tools. I am concurrently enrolled in the GEOG562: GIS Programming for Geospatial Analyses to learn some basic Python Coding, and am working independently to refine techniques for utilizing open-source geospatial packages developed for use in R and R-Studio. I am hoping that this class will be an additional way I can gain practice with these tools, and become exposed to other relevant methodology implemented by my peers.


Littell, J. S., Oneil, E. E., McKenzie, D., Hicke, J. A., Lutz, J. A., Norheim, R. A., & Elsner, M. M. (2010). Forest ecosystems, disturbance, and climatic change in Washington State, USA. Climatic change, 102(1), 129-158.

Geller, L. D. (2003), A Guide to Instream Flow Setting in Washington State, Washington State Department of Ecology.

Littell, J. S., E. E. Oneil, D. McKenzie, J. A. Hicke, J. A. Lutz, R. A. Norheim, and M. M. Elsner (2010), Forest Ecosystems, Disturbance, and Climatic Change in Washington State, USA., Clim. Change, 102(1), 129–158.

Schaller, H. A. et al. (2014), Walla Walla River Bull Trout Ten Year Retrospective Analysis and Implications for Recovery Planning, US Fish Wildl. Serv. Columbia River Fish. Program Off. Vanc. WA.

U.S. Forest Service (1990), Umatilla Forest Land and Resource Management Plan, United States Department of Agriculture.

U.S. Forest Service (2014), Blue Mountains National Forests Proposed Revised Land Management Plan, U.S. Department of Agriculture.

April 7, 2017

Topographic constraints on climate drive range shifts in Great Basin small mammals

Filed under: My Spatial Problem @ 8:55 pm

A description of the research question that you are exploring.

In 2015, Elsen and Tingley published “Global mountain topography and the fate of montane species under climate change,” in the journal Nature – climate change. They combined two data sets, the global data set of mountain ranges from Natural Earth’s physical vectors and a high-resolution near-global DEM (SRTM30) to evaluate the assumption that available non-vertical surface area decreases with increasing elevation in mountain ranges. This is an important question in community ecology and conservation as it applies directly to a known relationship between species diversity and area established by the field of island biogeography and more widely applied in recent decades. Specifically, the relationship states that physical space increases, the number of species that space can hold increases. Elsen and Tingly addressed this question across 182 of the world’s largest mountain ranges. However, they did not apply their analytical methods to any of the mountains in the basin and range province, except for the Sierra Nevada Mountain range which make its western border and the Rocky Mountains on its eastern border. The basin and range in comprised of north-south trending mountain ranges and have been the focus of several studies addressing questions relevant to island biogeography. Additionally, modern (2009 – 2016) and historical (1929-1931) small mammal surveys have established small mammal community composition for a number of these mountain ranges. In order to better understand the dynamics underlying the distribution of small mammal species along elevation gradients in these ranges I would like to perform similar analysis to Elsen and Tingley on three ranges for which historical and modern small mammal communities have been studied, the Toiyabe mountain range, Ruby Mountain range, and the Snake range. Specifically, these analyses would address the question; to which hypsographic classification are species range shifts subject? I hope to better contextualize the range dynamics of species in response to climate vegetation changes. Furthermore, understanding how available area changes with elevation in these ranges will enable better predictions about a species ability to remain within its thermal envelope by moving up slope.


A description of the dataset you will be analyzing, including the spatial and temporal resolution and extent.

Predominantly, I plan to use the SRTM30 DEM in combination with land-sat data to evaluate area as a function of elevation. I am not sure which dataset will be best to use for vegetation analysis. I have spatially explicit data on the distribution of small mammal species along elevation. However, small mammal species data is not continuous along the elevation gradient, or within habitat/elevation bands.

The following description of the SRTM30 dataset has been copied from


SRTM30_PLUS V1.0 November 11, 2004

INTRODUCTION: This data consists of 33 files of global topography in the same format as the SRTM30 products distributed by the USGS EROS data center. The grid resolution is 30 seconds which is roughly one kilometer. Land data are based on the 1-km averages of topography derived from the USGS SRTM30 gridded DEM data product created with data from the NASA Shuttle Radar Topography Mission. GTOPO30 data are used for high latitudes where SRTM data are not available. Ocean data are based on the Smith and Sandwell global 2-minute grid between latitudes +/- 72 degrees. Higher resolution grids have been added from the LDEO Ridge Multibeam Synthesis Project and the NGDC Coastal Multibeam Data. Arctic bathymetry is from the International Bathymetric Chart of the Oceans (IBCAO) [Jakobsson et al., 2003]. All data are derived from public domain sources and these data are also in the public domain. The pixel-registered data are stored in 33 files with names corresponding to the upper left corner of the array shown below.

The USGS SRTM30 data and documentation is available at


Hypotheses: predict the kinds of patterns you expect to see in your data, and the processes that produce or respond to these patterns.

Elsen and Tingley produced four categories of hypsographic classification based on the relationship between area and elevation for mountain ranges. These categories are Diamond, Hourglass, Inverse pyramid, and pyramid, these categories describe mountains that increase then decrease in area with elevation, decrease then increase, increase, or decrease, respectively. They found that approximately 68% of the 182 ranges they analyzed were not categorized by the pyramid shape category. Importantly, the pyramid category describes “the dominant assumption in ecology and conservation that area decreases monotonically with elevation from a mountain range’s base (Tingley and Elsen 2015).” Despite this finding, the basin and range province is the result of 40 million years of geologic and geomorphic processes, which include continental rifting, erosion, mantle uplift, Pleistocene pluvial lake bank erosion, glaciation, and Holocene warming and drying. Specifically of these forces, I expect that the listric normal faulting that characterizes the Basin and Range to have resulted in pyramid shaped mountains. I also expect that ranges will be spatially auto-correlated, with respect to the skew of east versus west facing slopes.


Approaches: describe the kinds of analyses you ideally would like to undertake and learn about this term, using your data. 

I would like to learn how to overlay mountain range polygons atop a hi-resolution digital elevation model. I will need to learn how to use the R-package “Raster” to make the hypsographic curves described above and by Elsen and Tingley. I would also like to use Arc to make maps.


Expected outcome:

I would like elucidate the area-elevation relationship to which small mammal species, and all species are subject to in the mountains of the Great Basin. Ideally, I would also love to evaluate available area for habitat type based on the distribution of vegetation type and % cover along elevation. In this regard, I would like to produce maps of habitat types and graphs that illustrate the percent cover of a habitat type as a percent of total available space on the mountain. It would also be nice to show which microhabitat features are associated with climate/ weather measurement stations in place on these mountain ranges. In addition, I would be excited if I could make predictions about the constraints on a species ability to move upslope based on available area, this may be represented as a plot of species richness against a combined elevation-area score.



The Great Basin has a unique geological and physographic history in North America and has an experienced accelerated pace of human impact throughout the Anthropocene. The combination of human land-use practices and climate change has significantly altered small mammal community composition, pushing it outside of its range of natural variability. Understanding the physical constraints to a species ability to persist on the landscape is a critical to predicting how shifting community dynamics are constrained by the physical environment.


Your level of preparation:

I feel comfortable using R and exploring new packages and I am typically able to find the support I need when I am trying to understand or apply the tools available in a new R package.  I have limited experience with Arc-Info and/or Arc-GIS, however I do have knowledge of a number of the basic rules to follow when using these tools. I have extremely limited experience with Python.

Socio-Demographic and People’s Intentions Relationship in a Neighborhood System Adapting to Floods

Filed under: 2017,My Spatial Problem @ 1:58 pm

Research question.

For this problem I want to answer:

How socio-demographic and spatial variables explain patterns of survey responses about attitudes regarding flood-safety in neighborhoods?  

Description of the dataset.

The dataset for the study has the following characteristics:

  • Data is obtained from Household voluntary survey
  • Convenience sampling. Households located within the 100 and 500  FEMA’s Flood hazard Map. All  at once.
  • Each participant answers socio-demographic and predefined intentions’ questionnaire
  • A printed coded survey questionnaire was mailed out to residents. The code is used to identify resident addresses.
  • 103 variables have been collected
  • Most of the variables are categorical

      An example of a typical variable to be analyzed:

  • Variable: Suppose your current home was to flood, how confident are you in the following possible conditions? – I will be able to evacuate my home before flooding begins (This variable is expressed in 5 categories of discrete values without hierarchy):
  • Categories:
    • Very confident
    • Confident
    • Neutral
    • Somewhat confident
    • Not confident at all

The spatial data consists of a map identifying land properties within the boundaries of the 100 year and 500 year flood hazard Fema’s map has been developed, as shown in Figure 1. The survey has been mailed out to randomly selected properties withing this map boundaries.

Figure 1. Map for South Corvallis affected properties according to FEMA’s 100 year and 500 year flood categories.


Attitudes regarding flood-safety in neighborhoods are clustered according to socio-demographic and spatial factors.


Principal Component Analysis (PCA) and Factor Analysis (FA) is applied to analyze the collected data.

Figure 2. Principal component (

For Geographic pattern and cluster analysis I will test:

  • Average Nearest Neighbor
  • Spatial Autocorrelation (Moran’s I)
  • High/Low Clustering (Getis-Ord General G)
  • Cluster and Outlier Analysis (Anselin Local Moran’s I)
  • Hot Spot Analysis (Getis-Ord Gi*)

Expected outcome:

I would like to find statistical relationships between the categorical variables collected from the survey according to its spatial location that define patterns formation. And also, maps of these relationships within the 100 year and 500 Fema’s Map of Figure 1.   


This research will contribute to policy and decision making for neighborhood adaptation to climate change. Patterns identification of attitudes regarding flood-safety in neighborhoods is important for planning adaptation to different flooding scenarios in order to minimize personal risks.

Level of preparation:

a) Arc-Info: Intermediate

b) Model Builder and/or GIS programming in Python: Intermediate

c) R, or other relevant  spatial analysis software: Beginner-Intermediate

Spreading of Red Blotch disease in vineyards

Filed under: 2017,My Spatial Problem @ 1:37 pm


Spatial patterns in disease spreading in agriculture are important to understand. For example, what causes the infection and how to avoid further spreading. In this project, we focus on the Red Blotch virus in vineyards located in Oregon. Red Blotch affects the sugar content in the grapes, which changes the taste of the produced wine, and symptoms involve red blotches on the leaves. Using remote sensing we might be able to develop an early warning system and get a better spatial understanding on how this disease spreads. Nowadays, the virus is spotted when the red blotches appear, which is in general too late to remove the plant and avoid spreading. Another method is to apply an PCR, Polymerase Chain Reaction, test which can detect infection in the leaves by looking at its DNA or RNA. Both these methods are inefficient in terms of time, labor, and money. Using an Unmanned Aerial Vehicle, UAV, or better known as a drone, enables us to get a bird’s perspective on the vineyards with the flexibility of controlling the spatial resolution (flying higher or lower) and temporal resolution (fly whenever you want). This is the big advantageous of UAV’s over satellite based imagery.

The first goal in this research is to develop the early warning system, but closely related, and maybe even dependent on that, is the second goal of understanding the spatial distribution of the disease spread.

The research question for this class will be; Is there a spatial correlation in the spread of the Red Blotch disease in vineyards in study sites in Oregon?


Methods and Materials

We will be using UAV’s to acquire the aerial imagery. Two different sensors will be used, the multispectral and hyperspectral camera. These differ in the number of bands, respectively 5 and 270 bands. The hyperspectral bandwidths range between 400-1000nm. The advantageous of hyperspectral over multispectral is the precision of the bands. This gives us more in detailed spectral information on the vineyards. Hyperspectral is used for its ability to detect chlorophyll fluorescence, which is highly correlated to plant health.

For the multispectral we use the MicaSence Multispectral Camera, RedEdge. Which has a spatial resolution of 8 cm/pixel at an elevation of 400 ft. Depending on the elevation we can increase or decrease this resolution.

The hyperspectral sensor is the Nano-Hyperspec, Headwall Photonics. Depending on the above ground level the spatial resolution is about 5 cm.

Throughout the growing season, starting in May until October, we will be flying every month or twice a month over a couple of vineyards. The exact location is still to be determined.



I expect that there is a high spatial correlation in the spreading of the disease. The virus is transmitted by grafting, but also a vector can play a role as vineyards seem to infect neighbors.

For my main research goal, I expect that we will be able to develop an early detection method as some symptoms of the disease are preceded by physiological changes in the plants, which we will hopefully be able to detect.



In this class, I would like to learn more about the spatial correlation and how to deal with that. In previous research, I have dealt with comparisons between mean values from spatial data, assuming independence in measurements. However, when you are analyzing a problem that is spatially correlated you need to take that into account.


Expected outcome

For this class I will be producing a method to determine which vineyards are affected and which ones are at risk of being affected. Most likely, this will be in the form of map. For this risk map, we need to know the statistical relationship between infected and not infected plants, and how large the probability of infection is.



Red blotch can significantly decrease the sugar content in the grapes, which can change the taste of the wines. At the moment, farmers remove vineyard which are infected. The earlier we know which vineyards are infected, the earlier they can be removed, and new plants can be planted.


Level of Preparation

  1. Arc-info advanced
  2. Modelbuilder and/or GIS programming in Python novice, but taking GIS-science III, GIS programming in Python
  3. R advanced beginner, some experience from STATS 511 and 512
  4. Matlab advanced

Stream Geomorphic Change Detection and Analysis in an Oregon Coast Range Basin

Filed under: My Spatial Problem @ 1:07 pm


My master’s thesis involves quantifying geomorphic change in streams after the addition of a large wood jam for salmon habitat restoration purposes. Large wood has been shown to have many positive effects on salmon habitat, including creating pools and backwater areas that are essential for juvenile salmon survival. This past summer, I surveyed seven stream reaches within a single basin in the Oregon Coast range to capture the instream and floodplain topography before the addition of a large wood restoration structure. I plan to survey these sites again next summer to capture changes in topography caused by the interaction of wood, water, and sediment.

The overall goal of my thesis is to examine how upstream drainage area and bankfull width affect the amount of geomorphic change after large wood is added to a stream reach. Since I only have the first half of my data gathered at this point, I will utilize cross sectional data from adjacent LW sites gathered by a previous Masters student for my analysis, developing the methods that I will eventually employ with my own data once I gather the second round of survey data next summer.

The questions I am hoping to answer are:

  • What are the most effective methods for making an accurate raster interpolation from survey points?
  • What is the best way to quantify geomorphic change?
  • How does stream size affect the amount of geomorphic change caused by the addition of a LW jam?


To conduct my analysis, I will use topographic data that I gathered using a Total Station over two separate summers, 2015 and 2016. The 2015 survey represents the stream topography before the LW was added, while the 2016 represents the topography after the LW was added and has interacted with the stream bed over one winter high flow season.

These topographic data consist of XYZ values that make up stream cross sections. The cross sections were spaced one half bankfull width apart. Points on the cross section were taken at the vertical and horizontal inflection points. The points were collected at a relatively fine scale, with an average of one point collected per 0.25-1 meter. Points were collected at a higher density in areas with more variation in elevation, usually the stream channel while points were more spaced out on the floodplain surrounding the stream channel where the ground is usually flatter, with less change in elevation. These values were exported as a .csv to ArcGIS after the survey was completed.

The survey points were not georeferenced. Instead, a control network of known survey points was established to ensure repeatability for revisit surveys. The first benchmark point was designated as the datum and assigned an arbitrary coordinate of 10000 m, 10000 m, 1000 m. Then other benchmarks were surveyed in using the high precision Total Station setting.

Possible sources of error in the data can be linked to field data collection methods. Examples include: the Total Station not being set up exactly level and aligned over the benchmark points, the benchmark point moving slightly, or the survey rod not being held exactly perpendicular to the ground surface on the point that is being marked. All of these variables can introduce small errors into the resulting survey points. Much care was taken to eliminate as much uncertainty in the data as possible, but it is certain that there were at least a few centimeters of error incorporated throughout the survey.

Figure 1. Points gathered at the mainstem Mill Creek site in 2015 and 2016.


My research focuses on geomorphic change at two levels: the reach level and the basin level. There have been many studies that examine geomorphic change induced by large wood jams at the reach level. Based on these previous studies, I would expect the channel to become more heterogeneous in terms of elevation and width after the addition of a LW jam. I would also expect the stream to get wider due to more inundation of the floodplain during winter high flows. Pools will form and deepen upstream of the jam.

At the basin level, I would expect LW jam reaches in streams of intermediate size to experience the most geomorphic change. This is based on the balance between two driving factors: percent contact between the LW and the stream channel (greatest at small sites) and size and duration of overbank flows (greatest at large sites). Since the data gathered by the previous Masters student was not intended to compare differences in geomorphic change regulated by scale, it will be hard for me to make conclusions using this dataset but I can still attempt to compare the amount of change between the three stream reaches.


Last term in GIS II, I experimented with different interpolation methods in ArcGIS to determine which method would produce the most accurate raster, based on comparison of interpolated raster elevation values and known survey point elevations. I also incorporated TIN editing processes to create more accurate DEMs. Then I can use these rasters to compare the stream geomorphology before and after the addition of a LW jam.

In this class I want to continue to refine my interpolation methods and incorporate geomorphic change detection analyses and statistics, including accounting for interpolation error and, ideally, field data collection error as well. I have recently been reading some papers that present new ideas for minimizing interpolation error, including removing the trend in the data before interpolating and then adding the trend back in afterwards so I am interested in exploring this method.

Once I am satisfied with my DEMs, I will use ArcGIS to subtract the “before” and “after” DEMs to create a DEM of difference (DoD). Then I will use either ArcGIS or R to quantify the amount of geomorphic change (percent or raw) that occurred.


  • Figures quantifying geomorphic change between the two surveys in terms of stream elevation and width
  • Statistical relationships between amount of geomorphic change before and after the addition of a LW jam
  • Statistical relationships between amount of geomorphic change at sites of different size (based on upstream drainage area and bankfull width)
  • Error analyses to determine how much of the change is “real”, incorporating field data collection and interpolation error


Many engineered LW studies have been conducted that examine LW jam effects on stream geomorphic change and juvenile salmonid populations. There have been many reach-scale analyses and a few larger-scale studies that synthesize data from many watersheds but there has been little quantification of the geomorphic and biological responses to engineered large wood jams on the watershed scale within an individual basin. This kind of research can help refine stream restoration efforts and optimize resource allocation.


ArcGIS: Intermediate level (I spent last term working with it. I can perform most basic manipulations and figure out what I don’t know how to do from the help menus and the internet.)

Python: Beginner level (I have used it in two classes to perform basic calculations and plot data/results.)

R: Very limited experience (I used it a few times in my undergraduate Statistics class but that was 7 years ago so I will probably be very rusty.)

Spatial Relationships of Tsunami Prediction Data for Data Assimilation Use

Filed under: My Spatial Problem @ 1:06 pm

My Spatial Problem Blog Post

  1. A description of the research question that you are exploring.

The primary objective of my work in this course is to obtain spatial correlation structures in simulated tsunami prediction data. These correlation structures are needed to implement a data assimilation procedure which will be used for risk and uncertainty analysis. Data assimilation is essentially a process by which observations are incorporated into a model state of a numerical model [1]. To incorporate observations, specific information about the correlation structures in these data need to be determined. In particular, the probability distributions and covariance matrices of the datasets will have to be determined/calculated.

The “research questions” I have are therefore: “What are the autocorrelation structures for the simulated tsunami inundation data?”, “How does the predicted tsunami inundation relate to the topography?”, “How can I automate the process of determining these correlation structures to incorporate into data assimilation processes?”

2. A description of the dataset you will be analyzing, including the spatial and temporal resolution and extent.

I will be using nearshore tsunami simulation data that predicts sea surface elevation and fluid velocities for a stretch of coast in southern California near Port Hueneme. There are 27 possible data sets each with a different initial tsunami source. This simulation data is provided by the Method of Splitting Tsunami (MOST) model which is the standard model used at the NOAA Center for Tsunami Research (NCTR) [2]. Inundation data for various simulated tsunami events over a duration of 216000 sec (= 3600 min) and over a geographical area of (-119.2469462, 34.1384259) to (-119.1949874, 34.2000000) will be used. There are 3600 time steps for each set of data on a 562 by 665 grid. The spacing between the longitude and latitudes are uneven but can be projected onto an even grid. A time series of sea surface elevation and fluid velocities is available for each geographical point (see Figure 1).

Figure 1 – Simulated tsunami inundation data using MOST v4 at Port Hueneme, CA for a simulated Cascadia source. Image is taken from the Performance Based Tsunami Engineering (PBTE) data explorer.

In addition to the tsunami data, I also have the topographical/bathymetric data for the region (which does not change in time) with the same spatial resolution as the gridded data. This data is presented in Figure 2.

Figure 2 – Topological/bathymetric data for Port Hueneme, CA in geographic coordinates (left) and UTM projected coordinates (right).

3. Hypotheses: predict the kinds of patterns you expect to see in your data, and the processes that produce or respond to these patterns.

The patterns in the data should reflect the physical characteristics of the local topography. I expect to see correlation patterns that reflect wave refraction, wave diffraction, shadowing effects, regions of dissipation and wave focusing. Overall, any patterns observed from the analysis of the data should make physical sense given the coastal geometry of Port Hueneme. I also expected the correlation structures to facilitate the data assimilation process to analyze uncertainties, but that is outside the scope of this course.

4. Approaches: describe the kinds of analyses you ideally would like to undertake and learn about this term, using your data.

Without much experience with geospatial data analysis, I am generally pretty open minded to techniques that help to determine correlation structures of my data. Some minimal research into geospatial data analysis techniques suggests that tools such as variogram analysis/modelling and/or spatial autocorrelation techniques could be useful for what I want to get out of the data. Since I am unfamiliar with all of these techniques, I intend to learn about any of the techniques I choose to implement for analysis.

5. Expected outcome: what do you want to produce — maps? statistical relationships? other?

I expect to be able to produce statistical relationships, namely correlation structures, possibly variograms, and covariance matrices. Additionally, I hope to be able to visualize them in some form so as to confirm their validity and for quick comparison from one tsunami source to another.

Given the amount of data at my disposal, it would nice to develop an automated process for producing a geospatial analysis (relationships, maps, etc.) at any given time step of my data. This would be greatly beneficial to my work and could be automated into the data assimilation procedure in the future.

6. Significance. How is your spatial problem important to science? to resource managers?

Tsunami inundation data has been very difficult to measure in the field throughout history due the inherent dangers of having people or instrumentation in the inundation zone. This fact, combined with the relative infrequency of tsunami events, forces managers and disaster planners to rely on prediction data from tsunami models [3]. The problem with this, however, is that these models are deterministic and do not give a good idea of what the uncertainty of these predictions are. This can be problematic when relying on these predictions to perform risk analyses.

To give managers better tools to plan for tsunami disasters, this spatial problem seeks to aid in the methodology provide uncertainty estimates for these predictions. Assessment of uncertainties for these model predictions can allow for more comprehensive and effective risk management. For the scale of the problem in this course, the spatial problem may simply highlight specific coastal features that may be particularly prone dangerous inundation conditions during a tsunami disaster. Additionally, the spatial analysis could highlight some tendencies of the tsunami predictions model that may or may not be good.

From a scientific perspective, this spatial problem is a stepping stone for an application of data assimilation that has yet to be performed. Additionally, spatial analysis of the model output data could provide valuable insight to how different features of a coastline might relate to one another in a tsunami inundation situation.

7. Your level of preparation: how much experience do you have with (a) Arc-Info, (b) Modelbuilder and/or GIS programming in Python, (c) R, or (d) other relevant spatial analysis software

As of the start of this course I have no experience in either Arc, GIS, R, or Python. I have sufficient experience in MATLAB and FORTRAN. I’m hoping to learn some combination Arc, R and Python and use those tools to perform the spatial analysis for my problem.


[1] Data assimilation. European Centre for Medium-Range Weather Forecasts (ECMWF). Retrieved from

[2] Numerical modeling of tidal wave runup, VV Titov, CE Synolakis (1999). Journal of Waterway, Port, Coastal, and Ocean Engineering 124 (4), 157-171,

[3] Wegscheider, S., Post, J., Zosseder, K., Mück, M., Strunz, G., Riedlinger, T., Muhari, A., and Anwar, H. Z.: Generating tsunami risk knowledge at community level as a base for planning and implementation of risk reduction strategies, Nat. Hazards Earth Syst. Sci., 11, 249-258, doi:10.5194/nhess-11-249-2011, 2011.

Spatial Relationships of Vegetation in Restored and Remnant Salt Marshes, Salmon River Estuary, Oregon

Filed under: My Spatial Problem @ 12:56 pm

My Spatial Problem Blog Post


  1. A description of the research question that you are exploring.

Salmon River is one of the smallest estuaries on the northern Oregon coast (254 acres), with the largest proportion of tidal marsh (68%) for any Oregon estuary. It borders Tillamook and Lincoln counties, and is designated as an Important Bird Area by The National Audubon Society.  Conservation Research at Salmon River Estuary has been a focus of government, non-profit, and educational institutions since the 1970’s due to concern over salmonid habitat and the impacts of sea level rise on the coast. Salmon River consists of public and protected wetlands that have been restored and protected since the U.S. Forest Service removed dikes from three sites in 1978, 1987, and 1996. Tidal flow to the ocean is currently unobstructed on sites that were previously used as pastureland and/or diked. One wetland on the estuary was never diked and is used as a reference marsh for field research, to determine functional equivalency of restored marshes. Salmon River Estuary has been a site for place-based restoration and studies of community recovery over the last 35 years (Flitcroft et al. 2016).

Factors influencing the richness and environmental integrity of Salmon River are associated with the physiognomic and taxonomic features of the plant community. This study focuses on the spatio-temporal distribution patterns of Salmon River vegetation to explore how remnant and restored marshes differ in terms of biodiversity and species composition. I expect that different durations of tidal exclusion through dike establishment will reveal differences between sites in the context of plant species composition and distribution. The biological research questions are as follows: What are the differences between remnant and restored marshes in the context of ecological gradients (tideland to upland)? How can those differences be explored with vegetation and soil data? How do these differences reflect land use history in this ecosystem? The methodological questions of interest for this research include: How do field methods, specifically nested, rectangular and non-nested, square (transect) plots capture spatial variation of vegetation differently? Do these field methods (transect versus nested plot sampling) differ in capturing species diversity? Ultimately, I would like to propose answers to who lives with whom, and why, for this specific estuary.


  1. A description of the dataset you will be analyzing, including the spatial and temporal resolution and extent.

I have collected species data (ocular estimations of percent coverage) from 1 m2 plots on transects from four tidal marshes: Mitchell Marsh (dike removed 1978), Y Marsh (dike removed in 1987), Salmon Creek Marsh (dike removed 1996), and one remnant marsh adjacent to Y marsh as a control (never diked). I also collected soil samples at each sampled transect plot, and tested them for salinity, conductivity, and bulk density, as well as nitrogen content. Transect plots were square shaped plots, 50 m apart in increasing distance from the tide.  My objective for data analysis is to describe the spatial patterns of the vegetation communities in tidal marshes of Salmon River Estuary after dike removal.

Stohlgren plots, also known modified Whittaker plots (MW), were established at each marsh site to collect data on species abundance for comparison with transect data. MW plots were implemented to test for patterns of diversity at multiple scales beyond what transect, square meter plots may capture within the same site. The restored and remnant sites have three MW plots each. The MW, plots were placed at a random distance 50 m along and 20 m offset from the sampled transects at each marsh for a stratified random sample design. At each MW plot, percent cover and presence/absence of species were estimated (with the aid of 1 meter square grids). Samples of all species identified were collected, pressed and are being examined to confirm identification. Elevation data were obtained from LiDAR surveys in 2015, and used to pinpoint elevation for all plots.

The datasets for my project include spreadsheets that describe percent vegetation cover, elevation, and soil characteristics per transect plot. MW plots were not sampled for soil and thus only have percent vegetation cover and elevation. The spatial grain of my study is one square meter, represented by the size of my smallest sampling frame for plots. With the nested sample plots and my stratified random sampling techniques, I have multiple spatial extents for this project. One extent could be considered the length of a transect (which vary by tidal marsh), the area of a MW plot (1,000 m2), or could arguably be extended to the entire Estuary. There are some interesting temporal aspects to my study as well; three of the four tidal marshes have experienced successive dike removal. These marshes have been surveyed for vegetation cover post dike removal and every 5 years subsequently. Incorporating these historical data will add dimension to my spatial and temporal analysis of variation at my study site.

Hypotheses: predict the kinds of patterns you expect to see in your data, and the processes that produce or respond to these patterns.

I expect that plots from the reference marsh (Transect C, adjacent to Y Marsh) will be more diverse and heterogenous than tidal marshes that have been diked. I predict sites that have experienced dike removal more recently will have different species composition and be less diverse compared to reference sites. I also predict that time since dike removal will increase similarities between reference and restored sites. I anticipate that sites which have experience recent dike removal have soils with higher bulk density, higher nitrogen content, and higher salinity and conductivity readings, compared to remnant marsh plots. These differences are likely due to successive dike removal and differences between tidal marsh sites.

  1. Approaches: describe the kinds of analyses you ideally would like to undertake and learn about this term, using your data.

I would like to explore more statistical analyses with additional data related to my site. I have accessed historical data from this site related to vegetation cover on established transect plots. I want to compare the vegetation coverage I surveyed in July 2016, to coverage of decades prior with ordinations (Non-metric Multidimensional Scaling), MRPP (Multi-Response Permutation Procedures), and ISA (Indicator Species Analysis). Most of this will be achievable with PC-Ord, though there are likely opportunities with Python/ArcInfo and R as well. I also want to explore vegetation cover data in species area relationships, with species accumulation curves. There are many software options with which I can do this as well.

  1. Expected outcome: what do you want to produce — maps? statistical relationships? other?

I plan to focus primarily on producing and examining statistical relationships with my research. My project is a highly-localized case study, with a temporal component. With statistical software, I will be able to produce graphics that will aid in visual interpretation of patterns as well, in conjunction with statistics. From that point I would like to connect my statistical analyses with biological interpretations to ground my work in context and understand how the Estuary is changing over time, post dike removal.

  1. How is your spatial problem important to science? to resource managers?

The spatial problem I have chosen to investigate is of importance to scientists, as it provides further insight into how Pacific Northwest Coastal Estuaries recover from land use and disturbance, a phenomenon that has not been thoroughly studied yet. Estuaries have historically served as habitat and resources for keystone salmonid fish species, invertebrates, migratory birds, waterfowl, and mammals (such as beavers), particularly in the Pacific Northwest (Klemas 2013). Restoring these habitats is critical for protecting wildlife, managing wetland resources and eco-services, and maintaining our shorelines, especially as we face sea level rise from impending climate change. Threats to environmental stability in the case of wetlands can also harm their cultural value. Wetlands have inherent eco-beauty and are among the many natural systems associated with outdoor recreation. If wetlands are disturbed via pollution, compaction, or compositional change in vegetation, little is understood about the certainty of recovery in the context of reference conditions.

Factors influencing the richness and environmental integrity of estuaries like Salmon River are associated with the physiognomic and taxonomic features of the plant community. This study focuses on the spatio-temporal distribution patterns of Salmon River vegetation to explore how remnant and restored marshes differ in terms of biodiversity and species composition. Salmon River Estuary is especially noteworthy due to its unique management history. Salmon River is exceptional compared to other Oregon coastal wetlands, as it was federally protected before any industries could establish influence. Arguably, Salmon River has avoided most disturbance from development because of its relatively small size; there have been no instances of dredging or jetty construction for the purposes of navigation. Previous use as pastureland with dike establishment in the 1960’s is the dikes established in the 1960s and removed from 1978 onwards encapsulates the majority of known human influence on the marsh. Beginning in 1978, periodic vegetation surveys on site with long term ecological research at Salmon River has created intimate knowledge of the estuary and promoted ecological sustainability.

There has been a dramatic shift with regards to wetland protection and how our government and the public views them. Over the last few decades, policies promoting wetland conversion and development have been exchanged for protection and regulation initiatives. Wetland management goals today are largely focused on restoration to compensate for loss and damage, which has forged new industries tasked with wetland recovery and monitoring. However, some mitigation project datasets and sites are too small to collect useful data or make a meaningful impact on a large environmental scale. It is necessary to amass a variety of high quality data on larger wetland areas over longer periods of time to address how natural recovery processes may be employed for wetland conservation. Salmon River is an excellent long term case study for examining the prospect of rehabilitation for ecosystem functionality and reference conditions (Frenkel and Moran 1991; Frenkel 1995; Flitcroft et al. 2016).

  1. Your level of preparation: how much experience do you have with (a) Arc-Info, (b) Modelbuilder and/or GIS programming in Python, (c) R, or (d) other relevant spatial analysis software

I have previous experience with Arc that I have developed and expanded upon at Oregon State while pursuing my MS in Geography. I have taken Dr. Robert Kennedy’s Python Programming for GIS course, which introduced me to Python and provided additional experience with Modelbuilder. I have limited experience with R (I have only used it a handful of times). Ultimately, I have analyzed most if not all of my data thus far in PC-ORD, under the guidance of Dr. Bruce McCune (the software creator), which enables a variety of ordination and multivariate analysis options.

Literature Cited

Adamus, P.R., J. Larsen, and R. Scranton. 2005. Wetland Profiles of Oregon’s Coastal

Watersheds and Estuaries. Part 3 of a Hydrogeomorphic Guidebook. Report to

Coos Watershed Association, US Environmental Protection Agency, and Oregon

Depart. of State Lands, Salem.

Borde, AM, Thome, RM, Rumrill S, Miller, LM. (2003). Geospatial Habitat Change

Analysis in Pacific Northwest Coastal Estuaries. Estuaries, 26, 1104-1116.

Flitcroft, RL, Bottom, DL, Haberman, KL, Bierley, KF, Jones, KK, Simenstad, CA, Gray, A,

Ellingson, KS, Baumgartner, E, Cornwell, TJ and Campbell, LA. 2016. Expect the

unexpected: Place-Based Protections can lead to Unforeseen benefits. Aquatic

Conservation: Marine and Freshwater Ecosystems (26): 39-59.

Mather, P.M. 1976. Computational methods of multivariate analysis in physical

geography. J. Wiley & Sons, London. 532 pp.

McCune, B. and J. B. Grace. 2002. Analysis of Ecological Communities. MjM Software,

Gleneden Beach, Oregon, USA (

National Soil Survey Center, Natural Resources Conservation Service, U.S. Department of

Agriculture (2009), Soil Survey Field and Laboratory Methods Manual. Soil Survey

Investigations Report No. 51.

Stohlgren, TJ, Falkner, MB, and LD Schell. 1995. A Modified-Whittaker nested vegetation

sampling method. Vegetatio 117: 113-121.

Weilhoefer, C, Nelson, WG, Clinton, P, Beugli, DM. (2013). Environmental

Determinants of emergent macrophyte vegetation in Pacific Northwest estuarine

tidal wetlands. Estuaries and Coasts, 36, 377-389


Habitat use of blue whales New Zealand’s industrial South Taranaki Bight region

Filed under: My Spatial Problem @ 12:44 pm

Research objectives

My research focuses on the ecology of blue whales (Balaenoptera musculus brevicauda) in New Zealand’s South Taranaki Bight region (STB). Despite the recent documentation of a foraging ground in the STB (Torres et al. 2015), blue whale distribution remains poorly understood in the southern hemisphere. The STB is New Zealand’s most industrially active marine region, and the site of active oil and gas extraction and exploration, busy shipping traffic, and proposed seabed mining (Torres 2013). This potential space-use conflict between endangered whales and industry warrants further investigation into the spatial and temporal extent of blue whale habitat in the region. My goals are to investigate the relationship between blue whale presence and their environment, and subsequently to examine how their space-use overlaps with industry presence. Specifically, I intend to:

  • Quantify the relationship between sea-surface temperature (SST), chlorophyll-a (chl-a), krill density, and blue whale presence
  • Investigate the spatial overlap between blue whale presence and oil and gas extraction platforms, the Trans-Tasman Resources Ltd. proposed seabed mining site, and shipping traffic

Map of New Zealand with the South Taranaki Region indicated by the white box.


A blue whale surfaces in front of an oil rig in the South Taranaki Bight. Photo by Kristin Hodge.


I will be working with data collected during vessel-based surveys in February of 2014, 2016, and 2017 in the STB. Blue whale sighting location and group size were recorded by observers during the surveys. To record the oceanographic conditions throughout the water column, profiles of water column depth, temperature, and salinity were recorded using a Sea-Bird microCAT (SBE 911plus) Conductivity, Temperature and Depth (CTD) sensor approximately every hour during survey and at every blue whale sighting. Krill density and patch size will be quantified from hydroacoustic backscatter data collected with a Simrad EK60 echosounder (Simrad ES120-7DD splitbeam transducer, 120kHz transceiver, 250 W, 1.024 ms pulse length, 0.5 s ping rate). The echosounder data have not yet been processed, however I hope to do so this term so that the prey data can be included in these analyses.

In addition to the in situ data collected during our surveys, I plan to incorporate satellite imagery of SST and chl-a concentration for the region. I will use satellite data generated from NASA Moderate Resolution Imaging Spectrometer (MODIS).

I have been provided with the locations of the oil and gas drilling platforms and the proposed site for the iron sands seabed mine. I will use ship automatic identification system (AIS) data for the shipping traffic layer.


  • Blue whale presence will show a positive relationship with chl-a concentration and krill density
  • Blue whale presence will show a negative relationship with SST
  • There will be apparent inter-annual differences in blue whale sighting distribution, reflecting the strong El Nino conditions seen in 2016
  • Blue whale presence will overlap spatially with industrial activities


I plan to use ArcMap to visualize blue whale presence and the layers of oceanographic data I have described previously (in situ SST and prey density, remote-sensed SST and chlorophyll-a). I will then use R to compute either a generalized linear model (GLM) or generalized additive model (GAM) to evaluate the association between blue whales and these oceanographic variables, with blue whale presence as the response variable.

For the overlap between blue whale presence and industry, I intend to use ArcMap to visualize this spatial overlap by creating buffers around the stationary platforms and the proposed mining site and examining how often blue whale sightings took place within those buffers.


I intend to produce map figures that will show oceanographic measurements interpolated over our study area, overlaid with our blue whale sighting locations for each year of study. I hope to be able to report the model results that quantify the impact of our measured oceanographic conditions on blue whale presence.

My priority for this course is the habitat analysis and modeling. The industry overlap will be more of a visual examination at this stage, and more quantitative analyses of impacts will take place subsequently once a foundational understanding of blue whale habitat use in the region has been established.


Despite their enormous size and once-large global population, relatively little is known about blue whale distribution and habitat use due to their elusive nature and relative inaccessibility for study. Blue whales have extremely high metabolic demands in addition to employing an energetically expensive lunge-feeding foraging strategy (Croll et al. 1998, Goldbogen et al. 2011, Hazen et al. 2015). The ability to consistently locate dense patches of prey is therefore critical to blue whale survival, making the documentation of foraging grounds such as the STB region important. The STB region presents a unique opportunity for studying this species in a location where they seem to be consistently found in high abundance and relatively close to shore. But beyond their relative accessibility, the understanding of blue whale habitat use on a foraging ground will contribute to the body of knowledge on these endangered and little-studied whales.

Under the New Zealand threat classification system, blue whales are currently listed as ‘Migrant’ in New Zealand waters. However, our preliminary analyses point toward the possibility of a resident population of blue whales in New Zealand. Observations of foraging, breeding, and nursing behaviors demonstrate the likelihood of the STB form multiple critical life history functions. The strong industrial presence in the region and the ongoing push for industry expansion makes gaining an understanding of the spatial and temporal extent of blue whale habitat critical for management decisions.

A blue whale mom and calf surface in the South Taranaki Bight. Photo by Dawn Barlow.


I have some experience with Arc and R from coursework and my own preliminary analyses. I think that with more time and more practice I will grow more confident in my ability to use them. I have used MATLAB some, but mostly as a platform for running more specialized software programs specific to my field. I have no experience in python.


Croll DA, Tershy BR, Hewitt RP, Demer DA, Fiedler PC, Smith SE, Armstrong W, Popp JM, Kiekhefer T, Lopez VR, Urban J, Gendron D (1998) An integrated approach to the foraging ecology of marine birds and mammals. Deep Res II 45:1353–+

Goldbogen JA, Calambokidis J, Oleson E, Potvin J, Pyenson ND, Schorr G, Shadwick RE (2011) Mechanics, hydrodynamics and energetics of blue whale lunge feeding: efficiency dependence on krill density. J Exp Biol 214:131–146

Hazen EL, Friedlaender AS, Goldbogen JA (2015) Blue whales (Balaenoptera musculus) optimize foraging efficiency by balancing oxygen use and energy gain as a function of prey density. Sci Adv 1:e1500469–e1500469

Torres LG (2013) Evidence for an unrecognised blue whale foraging ground in New Zealand. New Zeal J Mar Freshw Res 47:235–248

Torres LG, Gill PC, Graham B, Steel D, Hamner RM, Baker S, Constantine R, Escobar-Flores P, Sutton P, Bury S, Bott N, Pinkerton M (2015) Population, habitat and prey characteristics of blue whales foraging in the South Taranaki Bight, New Zealand.

Nitrate Abundance in 10 Oregon Watersheds as linked to percentage of Red Alder Forest Cover

Filed under: My Spatial Problem @ 11:31 am

Research Description

I am exploring aquatic chemistry among 10 streams in the Oregon Coast range. I am running anion analysis over the period of 1 year, sampling monthly. These streams are divided into three categories, three are located East of Mary’s Peak in the Rock Creek Watershed, three near the top of Mary’s peak (highest in elevation), and four streams are sampled on the Oregon Coast within 200 meters of the Pacific Ocean. I would like to perform a comparison analysis of anions between the three categories, and find if there is any significant difference in anion abundance between the three locations and over time (6 months of data collected so far).

Dataset Analysis

The dataset I will be analyzing includes temporal anion data (for 6 months, project still in progress) and GPS points for all 10 site locations. The anions analyzed include chloride, sulfate, fluoride, and nitrate. Nitrate may be the most interesting as far as ecological impact is concerned (known to be limited in aquatic ecosystems), but chloride has also been an abundant anion while looking at the data.


I hypothesize that due to the influence of Red Alder in the forested watersheds of the Oregon Coast range, the Oregon Coast and Rock Creek watershed streams will have a higher amount of measurable nitrate than those streams sampled near the top of the watershed (near Mary’s Peak). I also predict that Nitrate would be higher in the Fall (November and December) than in the spring (March and April) due to the “flushing” of nutrients from the forest floor after summer low flows.

Analysis Approach

I would like to perform statistical analysis on these streams as they relate to nitrate using R and map the differences using Arc GIS. Preferably I would like to delineate the 10 watersheds and then do a comparison analysis of the percentage of Red Alder in each watershed, and the amount of nitrate data collected in the stream.

Expected Outcome

I would like to produce a map showing nitrate export in all 10 study sites as well as statistical evidence of the different categorical zones. This map will include the 10 delineated watersheds, one watershed is “nested”, therefore the output will likely include 9 watersheds and rank the watersheds based on red alder abundance and nitrate concentration. I’d also like to explore a statistical analysis comparing the three categorical locations and nitrate concentrations.


Baseline data collection for anions in Oregon Coast Range streams over a year has not been done before. In the face of a changing climate, it is important to document and characterize stream chemistry for forest managers in the future. Increases in nitrates in these streams can affect local flora, fauna as well as increases nutrient loading to the ocean. This loading can affect macroinvertebrate and salmon habitat which can alter ecosystem dynamics and services that currently exist.

Level of preparation

I have basic knowledge of R, but I think it would the most useful tool for statistical analysis of the data. Arc-info is a program I at one time used daily but it has been many years since. I would like to be able to map the anion data between the 10 sites but may struggle in the beginning linking the data with Arc. I have no experience with Python or other relevant spatial analysis software.

Patch dynamics for short-interval disturbances of beetle outbreak and wildfire

Filed under: My Spatial Problem @ 11:05 am

Research Question

What controls high-severity patch size during wildfire when landscape have large portions of dead forest from a prior disturbance of beetle outbreak?


Fire is a complex landscape process that is controlled by the heterogeneity of vegetation/fuels, topography, and weather.  The landscapes created by fire may dictate forest structure and function for decades. In central interior British Columbia, lodgepole pine (Pinus contorta var. latifolia) forests have been the epicenter of a recent regional outbreak of mountain pine beetle (Dendroctonous ponderosae), which has resulted in widespread tree mortality and potentially increased forest vulnerability to wildfire. While lodgepole pine landscapes are accustomed to large-scale singular disturbances of beetle outbreaks and wildfire, beetle outbreaks followed by wildfires are less familiar. Warm winters and an abundance of mature lodgepole pine have facilitated the unprecedented scale of beetle activity in the region. The resulting tree mortality alters vegetation/fuel structure, fire behavior, and subsequent fire severity. It is unclear if the resulting landscape structure from beetle outbreak influences the subsequent patchwork from fire.  A number of regional studies have examined the controls of post-fire patch size under a single disturbance of wildfire (Collins and Stephens 2010, Harvey et al. 2016, Reilly et al. 2017).  An important knowledge gap I have identified is the role of underlying disturbance structure on patch dynamics during wildfire events.

In this course, my primary object is to determine the controls of patch size for high severity patches under conditions of consecutive landscape scale disturbances of beetle outbreak and fire.  In order to do this, there are a number of intermediary steps/questions to ask and answer:

  1. What are the patch sizes following beetle outbreaks?
  2. What are the patch sizes following wildfire?
  3. What patches are high-severity fire?
  4. What are the topographic roughness characteristics associated with each high severity patch?
  5. What is the radiative output (proxy for burning conditions) for each high-severity patch during day of burn?
  6. What is the dominant vegetation type and moisture level associated with each high severity patch prior to disturbance?
  7. What is the relationship between each explanatory variable – topography, burning conditions, vegetation, and beetle outbreak patch size, to the response variable –high-severity patch size?


Figure 1. Post-fire photo of a fire that burned through beetle-killed forest in 2014 with various patches of fire severity.

 Description of datasets

  1. The spatial extent of all layers are defined by the fire perimeters for three different fires that burned in 2012, 2013, and 2014.
  2. Tree mortality from the bark beetle outbreak has been identified by calculating a Normalized Difference Vegetation Index (NDVI) layer derived from Landsat 30m resolution from one year prior to each fire event.
  3. Fire severity patches will serve as the response variable. Fire severity patches were generated from calculating the Relative differenced Normalized Burn Ratio (RdNBR) from Landsat 30m resolution (Miller and Thode 2007). RdNBR is calculated by generating an NBR image for both pre- and post-fire, then calculating the dNBR, which is then used to calculate the RdNBR. The purpose of RdNBR is to minimize the bias of the pre-fire conditions.  This relativization process allows for comparison across fire events with minimal biasing from differences in pre-fire conditions.
  4. The pre-disturbance landscape structure including vegetation and moisture types will be based on a polygon shape file with the biogeoclimatic zones for the region. Biogeoclimatic zones are a specific classification system for British Columbia, Canada. Zones are based on climax vegetation while accounting for climatic and edaphic conditions (Meidinger and Pojar 1991).
  5. The gently rolling landscape of the central interior offers little topographic relief. While topography may not be a strong control over patch size, we will account for topographic variability with a layer for topographic roughness. Topographic roughness was calculated from a 3-by-3 moving window using a 25 m Digital Elevation Model (DEM).
  6. Weather is considered a strong control for fire severity and patch size. Day of burn weather conditions are based on the thermal band hotspot data from MODIS. The hotspot data is a point shape file that includes a recording of radiative energy for specific point locations.


My ecological hypothesis is that burning conditions (i.e. weather during day of burn) exert strong control over patch size.  Fire conducive weather, dry and windy conditions, control large patch development by facilitating crown fire in lodgepole pine dominated landscapes.  Beetle outbreaks change the landscape structure by killing trees, which results in decreased continuity of canopy fuels. The lack of canopy fuels may constrain fire spread to the surface and thus limiting patch size. However, fire weather may still override the changes in fuel arrangement and continuity created by beetle outbreak, where dry windy conditions produce large patches regardless of underlying patch characteristics from the beetle outbreak.


  1. Calculate patch variables for tree mortality from the beetle outbreak using Fragstats (McGarigal et al. 2012). Characterize patch size, complexity, and configuration.
  2. Calculate patch variables for high-severity patches using Fragstats (McGarigal et al. 2012). Characterize patch size, complexity, and configuration.
  3. Overlay layers and extract explanatory variables (topographic roughness, radiative energy, beetle outbreak patch size) that correspond to individual patches in ArcMap.
  4. Conduct multiple regression analysis to determine which variables influence high severity patch size using R statistical software.

Expected Outcomes

I would like to produce maps showing characteristics of the landscape. I would also like to have analysis showing if there is a relationship between any of these variables and patch size.


Overlapping disturbances of beetle-outbreak and wildfire will influence landscape structure and function for decades in central interior British Columbia, Canada.  It is important to characterize patch dynamics to generate ecological context for overlapping disturbances. This research can provide insight on patterns and controls for fires that were allowed to burn unhindered by suppression, which is important for understanding contemporary disturbance regimes and providing guidance for management decisions.

Level of Preparation

  • Arc: fairly proficient
  • Model builder and Python: have used both, but probably a bit rusty; novice
  • R: growing knowledge; able to conduct statistical analysis and process Landsat imagery


Collins, B. M., and S. L. Stephens. 2010. Stand-replacing patches within a “mixed severity” fire regime: Quantitative characterization using recent fires in a long-established natural fire area. Landscape Ecology 25:927–939.

Harvey, B. J., D. C. Donato, and M. G. Turner. 2016. Drivers and trends in landscape patterns of stand-replacing fire in forests of the US Northern Rocky Mountains (1984???2010). Landscape Ecology 31:2367–2383.

McGarigal, K., A. Cushman, and E. Ene. 2012. FRAGSTATS v4: Spatial patterns analysis program for categorical and continuous maps. Univeristy of Massachusetts, Amherst, Massachusetts.

Meidinger, D. V, and J. Pojar. 1991. Ecosystems of British Columbia. Page B.C.

Miller, J. D., and A. E. Thode. 2007. Quantifying burn severity in a heterogeneous landscape with a relative version of the delta Normalized Burn Ratio (dNBR). Remote Sensing of Environment 109:66–80.

Reilly, M. J., C. J. Dunn, G. W. Meigs, T. A. Spies, R. E. Kennedy, J. D. Bailey, and K. Briggs. 2017. Contemporary patterns of fire extent and severity in forests of the Pacific Northwest, USA (1985-2010). Ecosphere 8:e01695.

Integrating Mobile and UAV-based Lidar Point Clouds to Estimate Forest Inventory

Filed under: 2017,My Spatial Problem @ 8:49 am

1. The research question I’m exploring is: “What is the relationship do spectral reflectance signatures (across 5 multispectral bands) of southwestern white pine (P. strobiformis) seedlings in common garden boxes have with seedling position within the boxes or position within the plots?”

2. My raw data consist of 500 photos taken from a UAV of common garden boxes in northeastern Arizona. I used Agisoft Photoscan to compile the images into a 5 layer orthomosaic (Image 1) where each layer represents a discrete band from the multispectral sensor (Table 1). One of the 20 boxes was blurred in the reconstruction and omitted from subsequent processing leaving 19.


Image 1: Orthomosaic image of common garden boxes containing southwestern white pine (P. strobiformis) seedlings in Kaibab National Forest, Arizona, USA.

Table 1: Micasense Rededge multispectral sensor band designations and spectral spans.

Processed mosaic images are georeferenced using RTK GPS coordinates for targets which were arranged around perimeter of the AOI. Once georeferencing is completed, the stacked orthomosaic is exported from Photoscan as a TIFF file which can be viewed and manipulated further in ArcMap or R. Spectral indicies which inform plant physiology such as NDVI and TGI can easily be added using raster algebra (Image 2).

Image 2: Normalized Differential Vegetation Index (NDVI) layer for one box of seedlings

3. Based on visual analysis of NDVI there appears to be some patterns both at the box and plot levels. My hypothesis is that spatial statistics will reveal correlations between spectral reflective signatures of individual plants and the location of the plant within its box and/or within the plot as a whole.

4. After the NDVI and TGI layers are added to the data the real processing begins. Because plants are tightly spaced in some of the boxes, automatic segmentation of the individual seedlings was not an option. As a result, it is necessary to assign points at the center of all plants in the AOI (1697 seedlings from 19 boxes). Each seedling was assigned a ‘boxID’ field with a value corresponding to its box. Also, layers were created with points for the center of each box and for the point in the center of the whole plot.


Non vegetation features were masked from the data using an NDVI vegetation mask. All pixels within the boxes were classified into two categories using the NDVI layer (vegetation and non-vegetation). Pixels classified as vegetation within the boxes were exported as a separate raster and, Voila!, the mask was born (Image 3).

Image 3: Vegetation mask from NDVI layer for one box

The next postprocessing step is to create buffer zones around the plant points to which the plant pixels could be clipped. Rather than making the buffers uniform in size with some arbitrary diameter I wanted to use a more repeatable and defensible method. The solution I came up with was to tailor the buffer zones according to the distance to the closest neighboring point. The radii of the buffers were calculated as R= 0.45 * nn,  where nn is the distance to nearest neighbor (image 4).

Image 4: Plant buffer zones drawn according to 0.45 times the distance to the nearest neighboring point.

5, Although there will be many opportunities along the processing workflow to generate tables, figures, and images to help tell the story of this study, my main objective is quantifying the effects of box and plot position on mean canopy reflectance across the 5 bands and 2 spectral indices.  As a means for simplifying this, I plan to run PCA to try and consolidate the spectral data into a smaller number of principal components. The principal components can be compared to the spatial variables using multiple linear regression.

6. If this method works, it would be significant because genetic trials could be evaluated remotely. Common garden experiments are widely used for detecting phenotypes that are resistant to disease, drought, or pests. Traditionally, on-the-ground evaluations of plant health including subjective visual surveys are used to identify candidates for resistance. This method sets up a workflow for objectively and remotely classifying seedlings from common garden boxes and automates much of the segmentation of individual plant canopy pixels.

7. I have a strong foundation in ESRI ArcGIS and Agisoft Photoscan. In the past 9 months, I have become familiar with R for a variety of applications and plan to use it extensively in my analysis. One of the major issues I expect to deal with is related to the of the datasets I’m working with. The images and reconstructions can be 5-10GB in the raw form and tend to swell during processing. As a result, I would also like to use this class as an opportunity to think about how I can make more manageable files by extracting only the information that I’m interested in using and using geometric models to simplify highly detailed representations.

Sensitivity of Forest Structure Parameters Estimation to Unmanned Aircraft System (UAS) Data Processing Algorithms

Filed under: 2017,My Spatial Problem @ 12:40 am


The emission of Carbon Dioxide (CO2) is one of the greatest factors causing climate change. Forests, as the largest terrestrial carbon reservoir have an important role to reduce CO2 emissions and to prevent climate change. Considering the important role of forests to reduce CO2 emissions and prevent climate change, it is necessary to manage forests sustainably. In sustainable forest management, forest manager must frequently monitor and measure forest structure parameters, such as number of trees, tree locations, tree heights, crown width, biomass, and forest carbon. However, these forest structure parameter, including forest carbon, are usually measured through conventional field measurements, which are costly and labor intensive. New technology, such as Unmanned Aircraft Systems (UAS) can be used as alternative methods to monitor and measure forest structure parameters. There are some programs designed to process UAS visual imagery data and measure forest structure parameters from the data, for example Agisoft PhotoScan, PiX4D, Fusion, Trevaw, TrEx, and Wathershed.

Research question:

The purpose of my research is to evaluate several programs that are designed to measure forest structure parameters, such as number of trees, tree locations, diameter breast height (DBH), tree heights, and crown width which are needed to estimate biomass and forest carbon.  I will base my comparisons against manually segmented tree inventory counts, measured using Fusion software, that are randomly sampled from a hierarchical design. The research will answer questions: (1). How does the accuracy differ between data processing programs to measure forest structures, biomass, and carbon?; (2). What is the best program and model to measure forest structures, biomass, and carbon?. However, for the purpose of this class, I might eliminate some forest structure parameters and focus on parameters number of trees and tree locations.


I hypothesize that all software used in this study can be used to measure forest structure parameters and to estimate forest biomass and carbon, but all of the software have different accuracy due to different algorithms used in each one of the software.


This research will use Unmanned Aircraft System (UAS) visual imagery data that is taken from Sony camera NEX-5R mounted in a fix wing UAS. The data was gathered in Siberut National Park in West Sumatra Province, Indonesia. There are 3 flights with 300 images in each flight. The image resolution is 350 x 350 inch. I will discuss it with my committee members whether or not I should use all the images. I think I will focus on one flight for this class.


This study will use a modeling approach to measure forest structure parameters based on Unmanned Aircraft System (UAS) data. The sampling plots will be selected hierarchically using ArcGIS. The 3D point clouds will be generated using Agisoft PhotoScan and PiX4d. Then, based on the 3D point clouds, forest structure parameters, which include number of trees, tree locations, diameter breast height (DBH), tree heights, and crown width, will be measured using some forest data processing programs (Fusion, TreVaw, TrEx, and Watershed). In addition, I will develop allometric equations and use those equations to statistically measure biomass and carbon based on the tree heights and diameter breast height (DBH) or canopy width. The allometric equations will be based on the linear relationship between forest structure parameters (tree heights, DBH, and canopy width), forest biomass, and forest carbon. I will use RStudio to do the statistical analysis (RMSE and Linear Regression Model). Again, for the purpose of this class I will only focus on number of trees and tree locations. I also will not use all the programs for this class, Maybe I will focus on ArcGIS, Agisoft, Fusion, and TreVaw.

Image 1. The example of hierarchically selected sampling plots using ArcGIS.

Image 2. The example of 3D point cloud generating process in Agisoft PhotoScan.


Expected outcome:

If the hypothesis is true, then the accuracy of each program will be different due to different algorithm. There will be one best program and model compared to manual segmentation using Fusion. However, all the automatic segmentations will not be too different compared to manual segmentation (will be proven by statistical method).


This research will assess and propose UAS as a low cost alternative method for measuring forest structure parameters. Forest structure parameters are usually measured by forest managers using conventional methods of field ground measurements. These conventional methods are costly and labor intensive. By using UAS, forest managers will be able to monitor forest structure parameters in a faster and cheaper way. Furthermore, this research will assess and find the best program and model to estimate forest structure parameters through automatic segmentation. This new method will be very useful, especially when it is used in remote areas with limited accessibility, where it is almost impossible to do field ground measurements.

Level of Preparation:

I have some experience in using Arc GIS but it is just basic thing, and I need to learn more about this software. I used RStudio in two statistic classes (ST 511 and ST 512), and now I am taking ST 513. For other software that I will use in this research (Agisoft PhotoScan, PiX4D, Fusion, TreVaw, TrEx, and Watershed), I do not have much experience with all these software but I will learn how to use them.


April 6, 2017

Evaluating interspecific competition between wolves and cougar in northeast Oregon

Filed under: My Spatial Problem @ 3:36 pm

Background & Research Question(s)

The research I am conducing is focused on understanding and quantifying the competitive interactions between two apex predators, wolves and cougar. Populations of large carnivores have been expanding across portions of their historical range in North America, and sympatric wolves (Canis lupus) and cougars (Puma concolor) share habitat, home ranges, and prey resources (Kunkel et al. 1999, Husseman et al. 2003, Ruth 2004), suggesting these coexisting predators may be subject to the effects of interspecific competition (interference or exploitative). Interspecific competition among carnivores can affect the spatial distribution, demography, and population dynamics of the weaker predator (Lawton and Hassell 1981, Tilman 1986), but demonstrating asymmetric competition affects through quantified measures or experiments has remained difficult for highly mobile terrestrial mammals like large carnivores. Generally, it is expected that cougar are the subordinate competitor in wolf-cougar interactions, but the frequency and strength of agonistic interactions can be system specific and will determine the overall influence either predator has on the population dynamics of the community (Kortello et al. 2007, Atwood et al. 2009, Creel et al. 2001). My goal is to assess the influence wolf recolonization in northeast Oregon has had on cougar spatial dynamics. For this course, I plan to focus on two analyses associated with questions about how wolves influence the distribution of sites where cougar acquire food (prey) resources and how wolves influence cougar movement paths. More specifically, I hope to address:

  1. Is niche partitioning of kill sites evident between wolves and cougar in northeast Oregon?
  2. Has wolf presence/recolonization changed where cougar kill prey?
  3. Has wolf presence/recolonization changed cougar movement dynamics?


The data I’ll be using to address the above questions comes from location points downloaded from global positioning system (GPS) collars that were deployed on a sample of wolves and cougar in northeast Oregon (Figure 1). Collars were programmed to obtain 8 locations per day and were downloaded very 2-3 weeks to identify location “clusters” for field investigation as potential kill sites. After identifying “clusters” from location data for each carnivore, we loaded the coordinates for the clusters onto handheld GPS units and systematically searched surrounding areas for prey remains. When prey remains were located, we recorded the spatial coordinates of the kill site and determined the species, age, sex and physical condition of the prey item that was killed. This process was done before (2009 – 2012) and after (2014 – 2016) wolves recolonized a 1, 992km2 area (Mt. Emily WMU) and produced a data set of kill site locations for cougar (npre-wolf = 1,213; npost-wolf = 541) and wolves (n = 158).


Figure 1. Location of the Mt. Emily Wildlife Management Unit in northeast Oregon and global positioning system (GPS) locations for cougars monitored to determine kill site distribution and spatial dynamics pre- (2009-2012) and post-wolf recolonization (2014-2016).


Based on competition theory and emerging evidence on wolf-cougar interactions in other systems (Alexander et al. 2006, Kortello et al. 2007, Atwood et al. 2009), I expect cougar and wolves in northeastern Oregon to exhibit resource partitioning in foraging niche (resource partitioning hypothesis). Representative evidence for resource partitioning between carnivores would be if cougar kill sites occur on the landscape in areas disparate from wolf kill sites. Further, I expect the presence of wolves to affect cougar movement patterns and spatial distribution, which may be evident through a shift in the distribution and space used by cougar between pre- and post-wolf recolonization periods (niche shift [competitive exclusion] hypothesis). Under this premise, I would also expect cougar to alter their movement and space use relative to pre-wolf recolonization patterns (active avoidance hypothesis).

  1. Is niche partitioning of kill sites evident between wolves and cougar in northeast Oregon?

H1: Resource partitioning hypothesis – cougar kill site distribution on the landscape will occur spatially partitioned from wolf kill sites to avoid interference competition with wolves.

  1. Has wolf presence/recolonization changed where cougar kill prey?

H2: Niche shift (competitive exclusion) hypothesis – cougar may demonstrate altered foraging niche (prey acquisition sites will begin to occur on steeper slopes) and prey species selection (increased numbers of mule deer) between time periods with and without wolf presence.

  1. Has wolf presence/recolonization changed cougar movement dynamics?

H3: Active avoidance hypothesis – cougar will alter their spatial distribution and movement paths to avoid interference competition with wolves.


I will be using ArcMap to visualize data and extract site-specific habitat features for kill sites, but am also interested in avenues to address my questions I may not have thought of. I plan to examine changes in cougar kill site characteristics in a logistic regression framework via latent selection difference (LSD) functions (Czetwertynski 2007, Latham et al. 2013). LSDs allow for direct comparison of habitat selection between two groups of interest to contrast the difference in selection through quantifiable measurements of relationship strengths. The model takes the form,

w(x) = exp(β1x1 + β2x2 + … + βixi)

where w(x) represents the relative probability of wolves (coded as 1) occurring on the landscape compared to cougar (coded as 0) thus using predator species as the dependent variable. The selection coefficient βi is represented for each predictor variable (xi) from a vector of covariates (x) and is interpreted as the relative difference in selection between wolves and cougar, not the selection or use of a given habitat (Czetwertynski 2007). I will be using the lme4 package (Bates et al. 2011) in program R to estimate coefficients contrasting the differences between cougar and wolves, and between cougars pre- and post-wolf recolonization as it relates to kill site habitat features. For the first LSD analysis, known cougar and wolf kill sites will be regressed to evaluate the relative difference in selection for features explicit to these sites (Table 1). For the second LSD analysis, pre- (coded as 1) and post-wolf (coded as 0) cougar kill sites will be regressed to evaluate the relative selection differences in cougar kill sites as it relates to wolf presence.

I also plan to evaluate cougar movement (step length and turn angles) for differences between pre- and post-wolf time periods (question 3) using Geospatial Modelling Environment (GME version 7.2.0, Beyer 2012) and program R. I will also use R to summarize this data (dplyr, MASS, car packages) and evaluate/test for differences using ANOVA and/or non-parametric tests.

Table 1. Landscape structure characteristics of interest for use in regression and latent selection difference (LSD) function modeling in relation to wolf-cougar prey kill site occurrence in northeast Oregon.

Expected Outcome

I plan to produce visuals (graphs/tables/maps) that depict the statistical relationships between:

  • Wolf and cougar kill sites
  • Cougar kill sites pre- and post-wolf recolonization
  • Cougar movement pre- and post-wolf recolonization


Predation is recognized as a major factor influencing population dynamics in which the direct and indirect effects of predators can influence community level dynamics (Terborgh and Estes 2010, Estes et al. 2011). Predation effects on prey populations are tied to the complexities of intraguild dynamics, as the predation risk for shared prey can vary relative to the nature of predator-predator interactions as well as based on the behavioral responses of prey to predators (Atwood et al. 2009). In addition to addressing key ecological questions regarding predator-predator interactions, the results from this research will provide information on the effect of wolves on cougar populations, and potential effects of this expanded predator system on elk and mule deer populations. Knowledge gained from this study will be critical to effective management and conservation of cougar in Oregon and could be useful to other parts of western North America facing similar changes in community dynamics as wolves continue to expand their range.


  • ArcInfo: intermediate/advanced, comfortable with all ESRI products and finding independent sources where I do have knowledge gaps
  • Modelbuilder and Python: novice, have used both but limited knowledge of their strengths and limitations
  • R: intermediate, comfortable manipulating data, using base statistical packages for descriptive stats and more advanced regression modeling
  • Other: GME (intermediate), QGIS (novice)


Atwood, T.C., E.M. Gese, and K.E. Kunkel. 2009. Spatial partitioning of predation risk in a multiple predator-multiple prey system. Journal of Wildlife Management 73:876-884.

Beyer, H. 2012. Geospatial Modelling Environment (software version

Bates, D., M. Maechler, and B. Bolker. 2011. lme4: linear-mixed-effects models using S4 classes. Version 0.999375-42. Available online:

Estes, J.A., J. Terborgh, J.S. Brashares, M.E. Power, J. Berger, W.J. Bond, S.R. Carpenter, T.E. Essington, R.D. Holt, J.B.C. Jackson, R.J. Marquis, L. Oksanen, T. Oksanen, R.T. Paine, E.K. Pikitch, W.J. Ripple, S.A. Sandin, M. Scheffer, T.W. Schoener, J.B. Shurin, A.R.E. Sinclaire, M.E. Soule, R. Virtanen, and D.A. Wardle. 2011. Trophic downgrading of planet Earth. Science 333:301-306.

Creel, S., G. Sprong and N. Creel. 2001. Interspecific competition and the population biology of extinction-prone carnivores. Pages 35-60 in J.J. Gittleman, S.M. Funk, D. Macdonald, and R.K. Wayne (eds). Carnivore Conservation. Cambridge University Press, Cambridge, UK.

Czetwertynski, S.M. 2007. Effects of hunting on the demographics, movement, and habitat selection of American black bears (Ursus americanus. Ph.D. thesis, Department of Renewable Resources, University of Alberta, Edmonton, Canada.

Husseman, J.S., D.L. Murray, G. Power, C.M. Mack, and C.R. Wegner, and H. Quigley. 2003. Assessing differential prey selection patterns between two sympatric carnivores. Oikos 101:591-601.

Kortello, A.D., T.E. Hurd, and D.L. Murray. 2007. Interactions between cougar (Puma concolor) and gray wolves (Canis lupus) in Banff National Park, Alberta. Ecoscience 14:214-222.

Kunkel, K.E. and D. H. Pletscher. 1999. Species-specific population dynamics of cervids in a mulitpredator ecosystem. Journal of Wildlife Management 63:1082-1093.

Latham, A.D.M., M.C. Latham, M.S. Boyce, and S. Boutin. 2013. Spatial relationships of sympatric wolves (Canis lupus) and coyotes (C. latrans) with woodland caribou (Rangifer tarandus caribou) during calving season in a human-modified boreal landscape. Wildlife Research 40:250-260.

Lawton, J.H. and M.P. Hassell. 1981. Asymmetrical competition in insects. Nature 289:793-795.

Ruth, T.K. 2004. Patterns of resource use among cougars and wolves in northwestern Montana and southeastern British Columbia. Ph.D. dissertation, University of Idaho, Moscow, ID, USA.

Terborgh, J. and J.A. Estes. 2010. Trophic cascades: predators, prey, and the changing dynamics of nature. Washington, DC: Island Press.

Tilman, D. 1986. Resources, competition and the dynamics of plant communities. Pages 51-74 in M.J. Crawley ed. Plant Ecology. Oxford: Blackwell Scientific Publications.

Amphibian Disease in the Cascades Range, OR

Filed under: My Spatial Problem @ 1:06 pm

Carson Lillard
My Spatial Problem


Ranavirus (Rv) and chytrid fungus (Bd) are both deadly pathogens that infect amphibian hosts all over the world. Amphibian die-offs associated with both pathogens have occurred throughout North America, but not much is known about ranavirus in Oregon due to most research being concentrated on Bd. I plan to sample for ranavirus in the Cascades Range, OR. Therefore, one of my objectives is to detect the presence or absence of ranavirus in Oregon, starting in the Cascades Range via eDNA. In preparation, I would like to be able to create an occurrence probability map for ranavirus to determine sites that may contain the virus throughout Oregon to sample. However, I am not sure if needed information is available due to the lack of research on life history of the virus. After data is collected, I would like to assess influence of spatial patterns related to geography, land-use type, pathogen movement, and climate on the distribution of the two diseases, ranavirus and chytrid fungus.

  • Is ranavirus present in Oregon and at what scale?
  • Can ranavirus presence be predicted using statistical methods and life history data?
  • What kind of spatial patterns are associated with ranavirus and chytrid fungus presence, movement, and spread in the Cascades Range?


Spatial patterns related to geography, land use type, pathogen movement, and climate will affect the presence of ranavirus and chytrid fungus throughout the Cascades Range, OR.

Ranavirus will be more prevalent in areas that see high cattle and/or visitor use.

Ranavirus and chytrid will not be as prevalent in geographically isolated regions.

Bd and ranavirus will be more prevalent at sites where Pseudacris regilla (Pacific chorus frog) exist due to higher mobility.

Ranavirus and chytrid presence and load will vary spatially due to climate, temperature, and elevation.

With a warming climate, pathogens will spread to high elevation ecosystems (if they aren’t already there) and amphibian species will be more at risk of infection in those ecosystems.


I would like to learn how to map or predict occurrence of pathogens, and understand all the moving parts needed to do so. I would also like to learn the best way to represent and interpret spatial data related to pathogens since they are very complex and context-dependent. I am mainly interested in using ArcGIS programs or learn about other software that are more tailored to my question.

Expected Outcome
I would like to ultimately learn about statistical methods leading to statistical relationships based on pathogen occurrence and spatial patterns as mentioned above. If enough information is available, I would like to produce a map predicting ranavirus occurrence probability in Oregon, or globally, such as the map below showing predicted global Bd occurrence from Xie et al. 2016.


As previously stated, Ranavirus and chytrid fungus are both deadly pathogens that infect amphibians all over the world. Amphibian die-offs associated with both pathogens have occurred throughout North America, but not much is known about ranavirus in Oregon due to most research being concentrated on Bd. Ranavirus die-offs are thought to have occurred in Oregon, implying that the virus is present but no samples have been taken. Knowing whether or not ranavirus is present, managers and researchers can make more educated decisions with this baseline data. Interpreting any spatial patterns would also be helpful in learning more about life history and eco-pathology of both ranavirus and chytrid fungus.


I am intermediate to proficient with ArcMap by taking GEOG 560 and 561. I am currently in GEOG 562 and learning Python. Some novice experience with Rstudio from a stats course.

An investigation of post-wildfire forest greening dependence on snowpack.

Filed under: 2017,My Spatial Problem @ 10:14 am

Research Question

My spatial question involves the relationship between winter snowpack and the following summer’s forest greening following a severe wildfire. Specifically, I would like to conduct a multivariate regression to see how important snowpack is to revegetation following wildfires in the Columbia River Basin, with other variables including summer precipitation, soil texture, and elevation. Additionally, I will look at how the snowpack-forest greening relationship persists following a wildfire on an annual timeframe.


To estimate snowpack, I’ll be using a new snow cover frequency (SCF) product, which uses satellite imagery reflectance data from the Moderate Resolution Imaging Spectroradiometer (MODIS) to measure the percentage of snow-covered days over a user-defined span of time. MODIS imagery is acquired daily and is available from the year 2000 to the present. The spatial resolution of MODIS imagery is 500 meters. This project will consider October 1 to May 1 to be the relevant timeframe for each year considered. So in the example below, the highest pixel value for SCF (0.58) corresponds to the interpretation that 58% of the valid days from October 1 to May 1 were snow-covered.

MODIS reflectance data will also be used to estimate forest greening. The Enhanced Vegetation Index is calculated using red, infrared, and blue wavelength bands to estimate canopy greenness, a quality which depends on leaf area, chlorophyll content, and canopy structure. EVI images are readily available in the form of 16-day composites collected at a resolution of 500 m. For each summer season (June 1 – September 30), EVI data will be condensed across bands to create a maximum summer EVI image which will be utilized in the spatio-temporal analysis.

Wildfire burn perimeter and severity data will be obtained through the Monitoring Trends in Burn Severity (MTBS) project. Analysis will be constrained to wildfires within the CRB occurring over forested land cover with large areas of high burn severity. This data is available back to 1984, but will only be considered post-2000 due to the limiting data availability of MODIS imagery.

Other potential data sources include 800m resolution summer precipitation data from the PRISM climate group, 30m resolution elevation data from NASA’s Shuttle Radar Topography Mission, and vector soil texture data from the State Soil Geographic dataset.


The primary relationship driving this study is that between winter snowpack and the following summer’s revegetation following a severe wildfire. The first hypothesis of this project is that snowpack is the most important predictor variable for post-wildfire revegetation. Additionally, I expect that snowpack (estimated using SCF) is more strongly correlated with forest greening (estimated using EVI) directly following a severe wildfire, and that the strength of this correlation decreases with time following the wildfire.


Through this course, I hope to explore methods for conducting a multivariate regression across space. All of my data except the soil texture data is raster format, so I will convert that to raster to make the regression analysis more approachable. If there is time, I would also like to analyze the snowpack-greening relationship a non-burned plot of land as a control example. I expect to do most of my mapping in ArcMap, and the regression analysis potentially in R.

Expected outcomes:

  • Plots showing the R2 values (y-axis) for each wildfire over time (x-axis)
  • Maps of the selected wildfire(s), including their burn perimeter/severity, EVI, and SCF


Snow accumulation has already been shown to influence peak summer forest greenness, especially at moderate elevations (Trujillo et al. 2012). The post-wildfire relationship between snowpack and forest revegetation is critical to understand as current trends of increasing temperatures, more ephemeral snowpack and intensifying wildfire activity are all forecasted to continue. Consequently, an expanding area of western mountain regions will be undergoing disturbance, revegetation and successional growth.

Forest managers and watershed managers may find the analysis of this research useful in preparing for future climate regimes. As wildfires continue to become more prevalent, having a comprehensive understanding of the ecological impacts of such disturbances will be critical for effective management of post-wildfire landscapes.

The ecological implications of this research are multi-faceted, especially regarding the changing climate affecting western U.S. mountains. Forests in the CRB are significant contributors to carbon sequestration, as western forests are responsible for 20-40% of total carbon sequestration in the contiguous U.S. (Schimel et al. 2002; Pacala et al. 2001). Depending on how successful forests revegetate following wildfires, western forests’ role as carbon sinks versus carbon sources may become more uncertain in future climate scenarios.

Levels of experience:

ArcGIS: high

Modelbuilder/Python: low, but learning this term in Geog 562

R: low (limited to STAT 511 usage)

Other: MATLAB (only changing parameter values, not creating from scratch)


Pacala, S.W., G.C. Hurtt, D. Baker, and P. Peylin. 2001. “Consistent Land- and Atmosphere-Based U.S. Carbon Sink Estimates.” Science 292: 2316–20. doi:10.1126/science.1057320.

Schimel, David, Timothy G. F. Kittel, Steven Running, Russell Monson, Andrew Turnipseed, and Dean Anderson. 2002. “Carbon Sequestration Studied in Western U.S. Mountains.” Eos, Transactions American Geophysical Union 83 (40): 445–49. doi:10.1029/2002EO000314.

Trujillo, Ernesto, Noah P. Molotch, Michael L. Goulden, Anne E. Kelly, and Roger C. Bales. 2012. “Elevation-Dependent Influence of Snow Accumulation on Forest Greening.” Nature Geoscience 5 (10): 705–9. doi:10.1038/ngeo1571.


April 5, 2017

My Spatial Problem

Filed under: 2017,My Spatial Problem @ 8:01 pm

This is the blog page for Spring 2017 – My Spatial Problem posts

Using radar to quantify water in forests

Filed under: 2017,My Spatial Problem @ 1:54 pm

My research focuses on using radar measurements to isolate the water content of a landscape. This will involve comparing measurements taken during wet periods and dry periods, and comparing the backscatter. The areas of interest will mostly be single-age monoculture forest stands, for simplicity in isolating backscatter due to trees (branches, trunks, etc.) from backscatter due to water.

The dataset I’m working with is the Sentinel-1 dataset from the European Space Agency. This dataset covers the whole Earth with 6-day repetition, and has been collecting data since 2014. On the ground, the resolution is anywhere from 5 m to 20 m, depending on the method of scan used and what pre- and post-processing has been done.

My hypothesis is that canopy density influences water interception expect the water content of heavily forested areas will be higher for longer periods than more open terrain (e.g. cropland, urban areas) or younger stands of forest.

For this data, it first needs to be converted from slant-range to ground-range in order to be analyzed by geostatistical software. Because of the geometry of radar acquisition, the product will not line up geographically without some processing. In addition, the values in each pixel (digital number) need to be converted to a meaningful number, such as backscatter.

Once the processing steps have been completed, I will then need to build a weather database to line up the pixels in the images with how “wet” that pixel was at time of acquisition, as well as the uncertainty in the weather variables. The radar data is measured across the whole landscape, but weather stations are point-measurements, and any variables measured (dew point, precipitation, temperature, etc.) will increase in error with increasing distance from the station. For this course, I will focus on building the weather maps for my area and time.

The weather map will most likely be of rain accumulation prior to time of radar data acquisition. By comparing a large number of radar data, my hope is to capture the full range of water levels for a more robust model.

The expected outcome is to produce a statistical relationship between age of forest (or type of forest) with water quantity (normalized for time since it rained last). I also would like to produce maps, since I will be doing most of the work using maps and analyzing maps.

This problem has the potential to inform managers concerned with water retention. If my prediction is correct and more dense forests retain water for longer, then it provides land managers with data on how dense or sparse they should plant if they want to take water interception and retention into account.

I have enough experience in ArcGIS to build my weather map, but I would like to learn how to better streamline the process by using Modelbuilder or other Python programming. I don’t have any experience with Python beyond exposure in classes that I have taken. I am pretty good at R on my own, and I know how to find the answers to most problems I come across via internet searches. I am getting more familiar with ENVI and SNAP, which is the native software for processing the Sentinel-1 radar data.


Assessing Ponderosa Pine Plantations in the Willamette Valley

Research Question

My research explores the extent and distribution of ponderosa pine (Pinus ponderosa) plantations in the Willamette Valley, and the understory plant communities associated with them. In this class, I will be substituting my own field data with FIA (forest inventory and analysis) data collected by the USFS. The data I am collecting and assessing includes information about the plantation environments (or plot conditions, for the sake of this class), and vegetation response. I am primarily interested in determining how ponderosa pine canopy density affects understory species cover and richness. 


Willamette Valley ponderosa pine (WVPP) is a race of ponderosa pine that is native to western Oregon. WVPP grew extensively in the Valley pre-white settlement, and could be found in open woodlands alongside Oregon white oak (Quercus garryana). Burning by indigenous peoples, such as the Kalapuya, maintained the open structure of the forest that supported the growth of WVPP. After settlers began suppressing these once-frequent fires, fire-intolerant species, such as Douglas-fir (Pseudotsuga menziesii) began encroaching upon the WVPP range. Settlers also made use of the pine, harvesting it in great quantities for homesteads and other building materials. Soon, only a few small islands of original WVPP remained. These islands were genetically weak, and the species was threatened by potential extinction. 

Very few studies have been conducted on WVPP, so little is known about the understory species associated with west-side ponderosa. I assume that the open ponderosa pine stands maintained by the Kalapuya had unique plant associations that are now nearly non-existent. However, with the quantity of pure ponderosa pine stands increasing in the Willamette Valley, I am interested in assessing these new forest types and exploring the understory plant associations growing within them.


I used a subset of the Forest Service’s massive FIA dataset to replicate what I imagine my own data will look like. The biggest difference between this dataset and my own future data is its geographic location. Because the FIA plots are gridded, and the proportion of land that falls within WV ponderosa pine plantations is so small, there are very few FIA plots in the Willamette Valley that are entirely composed of ponderosa pine. As a substitute, I used plots from Deschutes county, which has a much higher proportion of pure ponderosa forests than the Willamette Valley does. There were 98 plots in Deschutes county that had ponderosa overstories. I used plot and subplot data to find the associated location, inventory year, canopy cover, elevation, slope aspect, and understory percent cover based on growth form (bare soil, forb, grass, shrub, and seedling). I included the environmental variables elevation, slope, and aspect to help explain variation within understory cover that canopy cover cannot explain.

Ponderosa Pine Plot Dataset


I hypothesize that understory species associated with ponderosa pine forests are adapted to open canopies with abundant light availability; therefore, the greatest  percent coverage of all understory growth forms will be present in low-cover canopies. I expect that there will be considerable variance in the understory response that cannot be attributed to canopy conditions, because environmental factors and previous land use will strongly influence the understory composition. However, I hope that including the environmental variables in my correlation assessment will help to identify the influence that canopy cover does exert over the understory. 


I will assess the auto-correlation within my primary explanatory and response variables (canopy cover and understory percent cover). I will accomplish this through a Global Moran’s Index analysis in ArcMap.

I will also assess the correlation between understory percent cover and the array of explanatory variables in my dataset through a multiple linear regression in R.

Expected outcome

This project will provide information about the auto-correlation within my primarily variables, and will also provide figures describing the relationships between understory cover, canopy cover, and environmental variables, and the significance of each relationship. 


My goal for this understory assessment is to provide information to land managers who are interested in managing their plantations in an ecologically beneficial way. Understory communities are frequently overlooked in working forests, but understory species and structural diversity is the most significant element of a forest for many animal species. Understory plants also influence forest microclimates, erosion, and fire patterns. My hope for this project is to identify relationships between canopy cover and understory cover, and to provide this information to managers so that they can make more informed decisions about tree spacing and thinning to facilitate a healthy understory community.

Level of preparation

I have used ArcMap for a few years and am comfortable navigating the software. I have some experience in R, but have not thought much about how it could apply to my project. I have not used the other software mentioned in this class.

© 2017 GEOG 566   Powered by WordPress MU    Hosted by