Sedona Examples

This repo demos Apache Sedona via tests and notebooks. It shows SedonaDB and SedonaSpark.

Here's how you can get up-and-running fast:

install the dependencies with pip install -r pyproject.toml
open Jupyter: jupyter lab

From there, you can easily run the notebooks.

You can also use the uv package manager to create your Python virtual environment. Here are the installation instructions with uv:

create an environment with uv venv
install the dependencies with uv pip install -f pyproject.toml
open Jupyter: uv run jupyter lab

Demo Apache Sedona Capabilities

Suppose you have the following table with the longitude/latitude for the starting and ending locations of trips:

+----------+---------+---------+---------+-------+-------+
|start_city|start_lon|start_lat| end_city|end_lon|end_lat|
+----------+---------+---------+---------+-------+-------+
|  New York|    -74.0|     40.8|Sao Paulo|  -46.6|  -23.5|
| Sao Paulo|    -46.6|    -23.5|      Rio|  -43.2|  -22.9|
+----------+---------+---------+---------+-------+-------+

Use Sedona to compute the distance in meters for each trip:

sql = """
SELECT 
    start_city,
    end_city,
    ST_DistanceSphere(
        ST_Point(start_lon, start_lat),
        ST_Point(end_lon, end_lat)) AS distance
FROM some_trips
"""
sedona.sql(sql).show()

Here's the result:


+----------+---------+-----------------+
|start_city| end_city|         distance|
+----------+---------+-----------------+
|  New York|Sao Paulo|7690115.108592422|
| Sao Paulo|      Rio|353827.8316554592|
+----------+---------+-----------------+

SedonaSpark examples

You can run the test suite as follows: uv run pytest tests.

Make sure to install on your machine to run the examples. You should also have Java installed (Java 17 works well).

Here's how to run the test suite with the printed output displayed: uv run pytest tests -s.

Unit testing Apache Sedona

A unit test compares an actual result to an expected outcome.

Let's see how to unit test the ST_Contains function.

Start by invoking the function:

sql = """
SELECT ST_Contains(
    ST_GeomFromWKT('POLYGON((175 150,20 40,50 60,125 100,175 150))'),
    ST_GeomFromWKT('POINT(174 149)')
) as result
"""
actual = sedona.sql(sql)

Now create an expected result and use chispa to confirm that the expected and actual DataFrames are the same:

expected = sedona.createDataFrame([(False,)], ["result"])
chispa.assert_df_equality(actual, expected)

This library provides an easy way to invoke the Sedona functions on your local machine!

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
data		data
images		images
notebooks		notebooks
sedona-examples		sedona-examples
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sedona Examples

Demo Apache Sedona Capabilities

SedonaSpark examples

Unit testing Apache Sedona

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

polygeo/sedona-examples

Folders and files

Latest commit

History

Repository files navigation

Sedona Examples

Demo Apache Sedona Capabilities

SedonaSpark examples

Unit testing Apache Sedona

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages