Notebook - Fractopo – KB7 Trace Data Validation
[1]:
import warnings
warnings.filterwarnings("ignore", message="The Shapely GEOS")
warnings.filterwarnings("ignore", message="In a future version, ")
warnings.filterwarnings("ignore", message="No data for colormapping provided via")
warnings.filterwarnings(
"ignore", message="Shapely 2.0 is installed, but because PyGEOS is also installed"
)
[2]:
from pathlib import Path
import geopandas as gpd
from fractopo import Validation
import matplotlib.pyplot as plt
Data (KB7)
[3]:
traces_path = Path("../../tests/sample_data/KB7/KB7_traces.geojson")
area_path = Path("../../tests/sample_data/KB7/KB7_area.geojson")
traces = gpd.read_file(traces_path)
area = gpd.read_file(area_path)
# Name the dataset
name = "KB7"
Validation (KB7)
[4]:
# Create validation object with fixing (i.e. modification of data) allowed.
# AREA_EDGE_SNAP_MULTIPLIER is overridden to keep catching this error even with future default
# value changes
kb7_validation = Validation(
traces, area, name=name, allow_fix=True, AREA_EDGE_SNAP_MULTIPLIER=2.5
)
[5]:
# Run actual validation and capture the outputted validated trace GeoDataFrame
kb7_validated = kb7_validation.run_validation()
Validation results (KB7)
[6]:
# Normal DataFrame methods are available for data inspection
kb7_validated.columns
[6]:
Index(['Name', 'Shape_Leng', 'geometry', 'VALIDATION_ERRORS'], dtype='object')
[7]:
# Convert column data to string to allow hashing and return all unique
# validation errors.
kb7_validated["VALIDATION_ERRORS"].astype(str).unique()
[7]:
array(['()', "('STACKED TRACES',)", "('MULTI JUNCTION',)",
"('MULTI JUNCTION', 'V NODE')", "('SHARP TURNS',)",
"('TRACE UNDERLAPS TARGET AREA',)"], dtype=object)
[8]:
# Better description function is found in fractopo.cli
from fractopo.cli import describe_results
describe_results(kb7_validated, kb7_validation.ERROR_COLUMN)
Out of 240 traces, 8 were invalid.
There were 5 error types. These were: STACKED TRACES V NODE MULTI JUNCTION SHARP TURNS TRACE UNDERLAPS TARGET AREA
The KB7 dataset contains the above errors of which MULTI JUNCTION
and TRACE UNDERLAPS TARGET AREA
are disruptive in further analysis.
See documentation: https://nialov.github.io/fractopo/validation/errors.html
Visualization of errors in notebook
Though visualization here is possible, GIS-software (e.g. QGIS, ArcGIS) are much more interactive and are recommended for actual fixing and further error inspection.
MULTI JUNCTION
[9]:
# Find MULTI JUNCTION erroneous traces in GeoDataFrame
kb7_multijunctions = kb7_validated.loc[
["MULTI JUNCTION" in err for err in kb7_validated[kb7_validation.ERROR_COLUMN]]
]
kb7_multijunctions
[9]:
Name | Shape_Leng | geometry | VALIDATION_ERRORS | |
---|---|---|---|---|
168 | None | 4.985900 | LINESTRING (466023.424 6692098.176, 466024.252... | (MULTI JUNCTION,) |
169 | None | 5.232445 | LINESTRING (466025.131 6692096.856, 466026.65 ... | (MULTI JUNCTION,) |
171 | None | 1.541532 | LINESTRING (466024.02 6692098.386, 466023.703 ... | (MULTI JUNCTION, V NODE) |
206 | None | 1.864415 | LINESTRING (466025.7 6692097.183, 466025.858 6... | (MULTI JUNCTION,) |
219 | None | 0.619474 | LINESTRING (466023.571 6692097.978, 466023.802... | (MULTI JUNCTION, V NODE) |
[10]:
kb7_multijunctions.plot(colors=["red", "black", "blue", "orange", "green"])
[10]:
<Axes: >
The plot shows that the green and blue traces abut at their endpoints which is not a valid topology for traces. The fix is done by merging the green and blue traces.
Additionally the orange trace has a dangling end instead of being accurately snapped to the black trace.
[11]:
# Example fix for blue and green traces
from shapely.ops import linemerge
gpd.GeoSeries(
[
linemerge(
[kb7_multijunctions.geometry.iloc[4], kb7_multijunctions.geometry.iloc[2]]
),
kb7_multijunctions.geometry.iloc[0],
]
).plot(colors=["green", "red"])
[11]:
<Axes: >
TRACE UNDERLAPS TARGET AREA
[12]:
# Find TRACE UNDERLAPS TARGET AREA erroneous traces in GeoDataFrame
kb7_underlaps = kb7_validated.loc[
[
"TRACE UNDERLAPS TARGET AREA" in err
for err in kb7_validated[kb7_validation.ERROR_COLUMN]
]
]
kb7_underlaps
[12]:
Name | Shape_Leng | geometry | VALIDATION_ERRORS | |
---|---|---|---|---|
207 | None | 0.679687 | LINESTRING (466025.246 6692096.378, 466025.377... | (TRACE UNDERLAPS TARGET AREA,) |
[13]:
# Create figure, ax base
fig, ax = plt.subplots()
# Plot the underlapping trace along with the trace area boundary
kb7_underlaps.plot(ax=ax, color="red")
area.boundary.plot(ax=ax, color="black")
# Get trace bounds
minx, miny, maxx, maxy = kb7_underlaps.total_bounds
ax.set_xlim(minx - 0.5, maxx + 0.5)
ax.set_ylim(miny - 0.5, maxy + 0.5)
[13]:
(6692095.263678445, 6692096.877513003)
The plot shows that the trace underlaps the target area at least on the northern end and maybe on the southern end. The fix is implemented by extending the trace to meet the target area boundary.
[ ]: