In our previous blog post [OAK Cameras for NDVI Perception](https://discuss.luxonis.com/blog/4922-oak-cameras-for-ndvi-perception) we explored NDVI approaches and how to calculate it using a multispectral camera. Today, we are elevating (pun intended) the NDVI perception to the next level by using a drone with a multispectral camera and using SAM2 model for field segmentation and health comparison. https://www.youtube.com/watch?v=f0k-RjcG6FE ### First: The hardware We used the [OAK-D-SR](https://shop.luxonis.com/products/oak-d-sr)'s PCBA and changed one CCM (Compact Camera Modules), so one sensor perceived the visible band (380-750nm) while the other perceived the NIR band (>750nm). [upl-image-preview url=https://discuss.luxonis.com/assets/files/2024-08-08/1723134182-666541-sr-customized.jpg] OAK-D-SR was connected to the RPi Zero 2W, which connected to the OAK-D-SR and saved both frames every second. Both of these devices were powered by a powerbank, and together with the DJI Mini 2 SE drone (249g), everything weighted 386g. [upl-image-preview url=https://discuss.luxonis.com/assets/files/2024-08-08/1723133693-365199-drone-weight.jpg] ### SAM2 segmentation After recording left/right frames, I used [SAM2 Demo app](https://sam2.metademolab.com/demo), uploaded the color video, and selected (only 3 at a time) fields we're interested in. After selecting the fields and running the model, you can check the segmentation results in the Networking tab, under "propagate_in_video" request. I saved these results in a file, and later decoded and visualized them. [upl-image-preview url=https://discuss.luxonis.com/assets/files/2024-08-08/1723133950-218658-sam2-networking.jpg] SAM2 results are in RLE encoded, so they need to be decoded to get the mask. You can use [pycocotools](https://pypi.org/project/pycocotools/) to decode the mask. ```python from pycocotools import mask as mask_utils mask = mask_utils.decode(annotation["segmentation"]) ``` ### NDVI comparison On the bottom of the demo you can see the NDVI comparison between the fields. Because NDVI is relative (not absolute), we can only use it to compare the health of the fields. [upl-image-preview url=https://discuss.luxonis.com/assets/files/2024-08-08/1723133962-436794-ndvi-median.jpg] Field 6 has **the highest NDVI value**, which is also evident from the colorized NDVI image - it's more green than the other fields. [upl-image-preview url=https://discuss.luxonis.com/assets/files/2024-08-08/1723134089-102830-field6.jpg] ### Visualization & Code We're using [Rerun](https://rerun.io/) for the whole visualization, and OpenCV for image processing and contour calculation (for nicer visualization). Below is the main logic behind the demo. **Full code** [can be found here](https://gist.github.com/Erol444/6ff9039066c098b9b9d6b37ae3412e63), and [full demo here](https://drive.google.com/file/d/18nHAm9tOKvF9NJ6HMQBMeoaYaN9fPcib/view?usp=sharing) (includes SAM results and videos). ```python # Run & initialize ReRun viewer rr.init('NDVI /w SAM2', spawn=True) # Prepare rerun visualization annotationContext = [(0, "Background", (0, 0, 0, 0))] for i, color in enumerate(colors): annotationContext.append((i + 1, f"Field {i + 1}", color)) rr.log(f"NDVI_Average/Field{i+1}", rr.SeriesLine(color=color, name=f"Field {i+1}") ) rr.log("Color", rr.AnnotationContext(annotationContext), timeless=True) t = 0 size = (800, 1280) for frame_idx in range(len(sam_data[0])): rr.set_time_sequence("step", t) t += 1 frames = get_all_frames() ndvi = calc_ndvi(frames['color'], frames['ir']) segmentations = np.zeros(size) for i, data in enumerate(get_sam_output(frame_idx)): for result in data.get("results", []): field_num = result['object_id']+i*3 # 3 segmentations per file # Decode the RLE mask mask = np.array(maskUtils.decode(result["mask"]), dtype=np.uint8) # Set full_mask to num where mask segmentations[mask == 1] = field_num + 1 # as 0 is Background line_strips = get_contours(mask) rr.log( f"Color/Contours{field_num + 1}", rr.LineStrips2D(line_strips, colors=colors[field_num]) ) rr.log( f"NDVI/Color/Contours{field_num + 1}", rr.LineStrips2D(line_strips, colors=colors[field_num] ) mean_ndvi = np.mean(ndvi[mask == 1]) rr.log(f"NDVI_Average/Field{field_num + 1}", rr.Scalar(mean_ndvi)) rr.log("Color/Image", rr.Image(frames['color'][..., ::-1])) rr.log("NDVI/Color", rr.Image(frames['ndvi_colorized'][..., ::-1])) rr.log("Color/Mask", rr.SegmentationImage(segmentations)) ``` ### Potential improvements An important thing to note is that NDVI is calculated per image view, not globally. This is why the field's median NDVI changes instead of being a constant number. To improve this, we could use the whole video (eg. do image stitching) and calculate the NDVI for the entire area. ## Conclusion We've shown how to use a drone with a multispectral camera to capture NDVI images and use the SAM2 model for field segmentation and health comparison. This approach can be used for various agriculture tasks, such as monitoring crop health, detecting diseases, and more. If you have any comments or suggestions, let me know in the comments!:)

WOW. Super Amazing. 😍😍😍 A very Nice experiment @"erik"#175

Hi @"rslearn"#3031 , - I used PY044, OV9282 without IR filters. I added filter that removes visible light (anything below 650nm), so only IR is passed through. - Power bank: I used [DIY Power Bank Box Charging Case](https://www.aliexpress.com/item/1005007089858304.html?spm=a2g0o.order_list.order_list_main.127.13301802gO1GuL) from Aliexpress - [STL files here](https://drive.google.com/file/d/1Dl29cFVTIKKgkab9z3BkVnTtv38jziuS/view?usp=sharing) I hope it helps!

Return to blog overview

CategoryCategory

Ghost title

Comments (0)

Categories
Blog Releases
Forum Nav

NDVI Drone with SAM2 segmentation

erik

In our previous blog post OAK Cameras for NDVI Perception we explored NDVI approaches and how to calculate it using a multispectral camera.

Today, we are elevating (pun intended) the NDVI perception to the next level by using a drone with a multispectral camera and using SAM2 model for field segmentation and health comparison.

First: The hardware

We used the OAK-D-SR's PCBA and changed one CCM (Compact Camera Modules), so one sensor perceived the visible band (380-750nm) while the other perceived the NIR band (>750nm).

OAK-D-SR was connected to the RPi Zero 2W, which connected to the OAK-D-SR and saved both frames every second. Both of these devices were powered by a powerbank, and together with the DJI Mini 2 SE drone (249g), everything weighted 386g.

SAM2 segmentation

After recording left/right frames, I used SAM2 Demo app, uploaded the color video, and selected (only 3 at a time) fields we're interested in. After selecting the fields and running the model, you can check the segmentation results in the Networking tab, under "propagate_in_video" request. I saved these results in a file, and later decoded and visualized them.

SAM2 results are in RLE encoded, so they need to be decoded to get the mask. You can use pycocotools to decode the mask.

from pycocotools import mask as mask_utils
mask = mask_utils.decode(annotation["segmentation"])

NDVI comparison

On the bottom of the demo you can see the NDVI comparison between the fields. Because NDVI is relative (not absolute), we can only use it to compare the health of the fields.

Field 6 has the highest NDVI value, which is also evident from the colorized NDVI image - it's more green than the other fields.

Visualization & Code

We're using Rerun for the whole visualization, and OpenCV for image processing and contour calculation (for nicer visualization).

Below is the main logic behind the demo. Full code can be found here, and full demo here (includes SAM results and videos).

# Run & initialize ReRun viewer
rr.init('NDVI /w SAM2', spawn=True)

# Prepare rerun visualization
annotationContext = [(0, "Background", (0, 0, 0, 0))]
for i, color in enumerate(colors):
    annotationContext.append((i + 1, f"Field {i + 1}", color))
    rr.log(f"NDVI_Average/Field{i+1}",
        rr.SeriesLine(color=color, name=f"Field {i+1}")
        )
rr.log("Color", rr.AnnotationContext(annotationContext), timeless=True)

t = 0
size = (800, 1280)
for frame_idx in range(len(sam_data[0])):
    rr.set_time_sequence("step", t)
    t += 1

    frames = get_all_frames()
    ndvi = calc_ndvi(frames['color'], frames['ir'])

    segmentations = np.zeros(size)
    for i, data in enumerate(get_sam_output(frame_idx)):
        for result in data.get("results", []):
            field_num = result['object_id']+i*3 # 3 segmentations per file
            # Decode the RLE mask
            mask = np.array(maskUtils.decode(result["mask"]), dtype=np.uint8)
            # Set full_mask to num where mask
            segmentations[mask == 1] = field_num + 1 # as 0 is Background

            line_strips = get_contours(mask)
            rr.log(
                f"Color/Contours{field_num + 1}",
                rr.LineStrips2D(line_strips, colors=colors[field_num])
                )
            rr.log(
                f"NDVI/Color/Contours{field_num + 1}",
                rr.LineStrips2D(line_strips, colors=colors[field_num]
                )

            mean_ndvi = np.mean(ndvi[mask == 1])
            rr.log(f"NDVI_Average/Field{field_num + 1}", rr.Scalar(mean_ndvi))

    rr.log("Color/Image", rr.Image(frames['color'][..., ::-1]))
    rr.log("NDVI/Color", rr.Image(frames['ndvi_colorized'][..., ::-1]))
    rr.log("Color/Mask", rr.SegmentationImage(segmentations))

Potential improvements

An important thing to note is that NDVI is calculated per image view, not globally. This is why the field's median NDVI changes instead of being a constant number. To improve this, we could use the whole video (eg. do image stitching) and calculate the NDVI for the entire area.

Conclusion

We've shown how to use a drone with a multispectral camera to capture NDVI images and use the SAM2 model for field segmentation and health comparison. This approach can be used for various agriculture tasks, such as monitoring crop health, detecting diseases, and more.

If you have any comments or suggestions, let me know in the comments!🙂

SamiUddin

WOW. Super Amazing. 😍😍😍
A very Nice experiment @erik

OsmanYlmaz

I did not understand the hardware configuration. I could not find how to do this setup from your web page.

erik

OsmanYlmaz I used the OAK-D-SR (without an enclosure, lower weight) and changed one of the CCM so it perceives only NIR band. You can read about changing CCMs here:
https://docs.luxonis.com/hardware/platform/sensors/ccms/#Compact%20Camera%20Modules%20(CCMs)-Replacing%20CCMs

rslearn

What specific CCM sensor for NIR band was used, and what specific power bank was used? Could you share the 3d assembly CAD files too? @erik

erik

Hi @rslearn ,

I hope it helps!

Ghost title

Comments (0)

Categories

Forum Nav