Intro to Lidar Data - Earth analytics python course module

Welcome to the first lesson in the Intro to Lidar Data module. This tutorial covers the basic principles of LiDAR remote sensing and the three commonly used data products: the digital elevation model, digital surface model and the canopy height model. Finally it walks through opening lidar derived raster data in Python

LiDAR Background

Watch the videos below to better understand what lidar is and how a lidar system works.

The Story of LiDAR Data video

How LiDAR Works

Let’s Get Started - Key Concepts to Review

Why LiDAR

Scientists often need to characterize vegetation over large regions. Scientists use tools that can estimate key characteristics over large areas because they don’t have the resources to measure each individual tree. These tools often use remote methods. Remote sensing means that scientists aren’t actually physically measuring things with their hands, they are using sensors which capture information about a landscape and record things that they can use to estimate conditions and characteristics.

Conventional on the ground methods to measure trees are resource intensive and limit the amount of vegetation that can be characterized. Source: National Geographic.

To measure vegetation across large areas you need remote sensing methods that can collect many measurements, quickly, using automated sensors. These measurements can be used to estimate forest structure across larger areas.

LiDAR, or light detection ranging (sometimes also referred to as active laser scanning) is one remote sensing method that can be used to map structure including vegetation height, density and other characteristics across a region. LiDAR directly measures the height and density of vegetation (and buildings and other objects) on the ground making it an ideal tool for scientists studying vegetation over large areas.

LEFT: Remote sensing systems which measure energy that is naturally available are called passive sensors. RIGHT: Active sensors emit their own energy from a source on the instrument itself. Source: Natural Resources Canada.

Lidar is an Active Remote Sensing System

LIDAR is an active remote sensing system. An active system means that the system itself generates energy - in this case light - to measure things on the ground. In a LiDAR system, light is emitted from a rapidly firing laser. You can imagine a light quickly strobing from a laser light source. This light travels to the ground and reflects off of things like buildings and tree branches. The reflected light energy then returns to the LiDAR sensor where it is recorded.

A LiDAR system measures the time it takes for emitted light to travel to the ground and back. That time is used to calculate distance traveled. Distance traveled is then converted to elevation. These measurements are made using the key components of a lidar system including a GPS that identifies the X,Y,Z location of the light energy and an Internal Measurement Unit (IMU) that provides the orientation of the plane in the sky.

How Light Energy Is Used to Measure Trees

Light energy is a collection of photons. As the photons that make up light move toward the ground, they hit objects such as branches on a tree. Some of the light reflects off of those objects and returns to the sensor.

If the object is small, and there are gaps surrounding it that allow light to pass through, some light continues down towards the ground. Because some photons reflect off of things like branches but others continue down towards the ground, multiple reflections may be recorded from one pulse of light.

The distribution of energy that returns to the sensor creates what is called a waveform. The amount of energy that returned to the LiDAR sensor is known as “intensity”. The areas where more photons or more light energy returns to the sensor create peaks in the distribution of energy. These peaks in the waveform often represent objects on the ground like - a branch, a group of leaves or a building.

An example lidar waveform. Source: NEON, Boulder, CO.

How Scientists Use LiDAR Data

There are many different uses for LiDAR data.

LiDAR data classically have been used to derive high resolution elevation data

LiDAR data have historically been used to generate high resolution elevation datasets. Source: National Ecological Observatory Network.

LiDAR data have also been used to derive information about vegetation structure including

Canopy Height

Canopy Cover

Leaf Area Index

Vertical Forest Structure

Species identification (in less dense forests with high point density LiDAR)

Discrete vs. Full Waveform LiDAR

A waveform or distribution of light energy is what returns to the LiDAR sensor. However, this return may be recorded in two different ways.

A Discrete Return LiDAR System records individual (discrete) points for the peaks in the waveform curve. Discrete return LiDAR systems identify peaks and record a point at each peak location in the waveform curve. These discrete or individual points are called returns. A discrete system may record 1-4 (and sometimes more) returns from each laser pulse.

A Full Waveform LiDAR System records a distribution of returned light energy. Full waveform LiDAR data are thus more complex to process; however, they can often capture more information compared to discrete return LiDAR systems.

An example lidar waveform. Source: NEON.

LiDAR File Formats

Whether it is collected as discrete points or full waveform, most often LiDAR data are available as discrete points. A collection of discrete return LiDAR points is known as a LiDAR point cloud.

The commonly used file format to store LIDAR point cloud data is the .las format. The .laz format is a highly compressed version of .las and is becoming more widely used.

LiDAR Data Attributes: X,Y, Z, Intensity and Classification

LiDAR data attributes can vary, depending upon how the data were collected and processed. You can determine what attributes are available for each lidar point by looking at the metadata.

All lidar data points will have:

X,Y Location information: This determines the x,y coordinate location of the object that the lidar pulse (the light) reflected off of

Z (elevation values): representing the elevation of the object that the lidar pulse reflected off of.

Most lidar data points will have:

Intensity: representing the amount of light energy recorded by the sensor.

Classified Lidar Point Clouds

Some LiDAR point cloud data will also be “classified”. Classification refers to tagging each point with the object off which it reflected. So if a pulse reflects off a tree branch, you would assign it to the class “vegetation”. If the pulse reflects off the ground, you would assign it to the class “ground”. Classification of LiDAR point clouds is an additional processing step.

Some LiDAR products will be classified as “ground/non-ground”. Some datasets will be further processed to determine which points reflected off of buildings and other infrastructure. Some LiDAR data will be classified according to the vegetation type.

What is a Data Product?

A data product, is the data that are DERIVED from an instrument, or information collected on the ground. For instance, you may go out in the field and measure the heights of trees at 20 plots. Then calculate an average height per plot. The average value is DERIVED from the individual measurements that you collected in the field.

When dealing with sensor data, the sensors often collect data in a format that needs to be processed in order to get usable values from it.