Modules

Convert

convert.convert.convert(indir, proj, band_info, startdate, enddate, outdir=None, overwrite=False)

Compile features from multiple processings into one geodataframe

Parameters:

inlist (list) – List of files or geopandas.dataframe.DataFrame objects to merge
proj (str) – Projection identifier
band_info (list) – Band information
startdate (str) – Start date
enddate (str) – End date
outdir (str, optional) – Output file location as string
overwrite (bool, optional) – Flag to overwrite existing file

Returns:

all_gdf – Compiled goedataframe

Return type:

geopandas.dataframe.GeoDataFrame

convert.raster_to_vector.raster_to_vector(infile, proj, band_info, startdate, enddate, outfile=None, overwrite=False)

Convert raster to vector file with geopandas

Parameters:

infile (str) – Input file location as string
proj (str) – Projection identifier
band_info (list) – Band information
startdate (str) – Start date
enddate (str) –
End date outfile : str, optional

Output file location as string
overwrite (bool, optional) – Flag to overwrite existing file

Returns:

all_gdf – Converted vectors geodataframe

Return type:

geopandas.GeoDataFrame

Merge

merge.merge_vectors.merge_vectors(inlist, proj='EPSG:3413', outfile=None, overwrite=False)

Compile features from multiple processings into one geodataframe

Parameters:

inlist (list) – List of files or geopandas.dataframe.DataFrame objects to merge
proj (str, optional) – Projection identifier
outfile (str, optional) – Output file path to write files toall_gdf.to_file(outfile)
overwrite (bool, optional) – Flag to overwrite existing file

Returns:

all_gdf – Compiled goedataframe

Return type:

geopandas.dataframe.GeoDataFrame

Metadata

metadata.add_metadata.add_metadata(iml, names, regions, outfile=None, overwrite=False)

Add all metadata information to inventory

Parameters:

iml (geopandas.GeoDataFrame or str) – Inventory GeoDataFrame object or filepath
names (geopandas.GeoDataFrame or str) – Placenames database GeoDataFrame object or filepath
regions (geopandas.GeoDataFrame or str) – Regions identifier GeoDataFrame object or filepath
outfile (str) – Filepath for output to be saved to
overwrite (bool, optional) – Flag whether to overwrite existing file

Returns:

iml – Inventory GeoDataFrame with metadata

Return type:

geopandas.GeoDataFrame

metadata.assign_certainty.assign_certainty(gdf, search_names, scores, source='all_src')

Assign certainty score to geodataframe based on sources

Parameters:

gdf (geopandas.GeoDataFrame) – Vectors to assign certainty to
search_names (str) – Names of sources to count and determine certainty
scores (list) – List of scores of certainty
sources (str) – Column name of sources information

Returns:

gdf – Vectors with certainty metadata assigned

Return type:

geopandas.GeoDataFrame

metadata.assign_id.assign_id(gdf, col_name='lake_id')

Assign unique identification numbers to non-overlapping geometries in geodataframe

Parameters:

gdf (geopandas.GeoDataFrame) – Vectors to assign identification numbers to
col_name (str) – Column name to assign ID from

Returns:

gdf – Vectors with assigned IDs

Return type:

geopandas.GeoDataFrame

metadata.assign_sources.assign_sources(gdf, col_names=['lake_id', 'source'])

Assign source metadata to geodataframe, based on unique lake id and individual source information

Parameters:

gdf (geopandas.GeoDataFrame) – Vectors to assign sources to
col_names (list) – Column names to assign sources from

Returns:

gdf – Vectors with assigned sources

Return type:

geopandas.GeoDataFrame

stats

stats.method_stats.method_stats(infile1, infile2, outfile)

Calculate general statistics on a lake inventory file

Parameters:

infile1 (str) – File path to lake geodataframe
infile2 (str) – File path to basin (as polygons) geodataframe
outfile (str) – Outputted file name for general statistics

stats.reformat.aggregate(geofile, col_name='lake_id')

Generate areal statistics for aggregated geodataframe. Aggregation is determined from a given column name

Parameters:

geofile (gpd.GeoDataFrame) – Dataframe to perform aggregation statistics on
col_name (str, optional) – Column name to aggregate dataframe by. The default is “lake_id”

Returns:

agg_geofile – Aggregated dataframe with updated areal statistics

Return type:

gpd.GeoDataFrame

stats.reformat.centroids(geofile)

Generate centroids for geodataframe

Parameters:: geofile (gpd.GeoDataFrame) – Dataframe to obtain centroids for
Returns:: geofile – Dataframe with centroid information
Return type:: gpd.GeoDataFrame

test

class test.test.TestGrIML(methodName='runTest')

Bases: TestCase

Unittest for the GrIML post-processing workflow

create_sample_pointfile(filepath, num_features=5): Generate a synthetic GeoDataFrame with simple point geometries

create_sample_polyfile(filepath, num_features=5, side_length=1.0): Generate a synthetic GeoDataFrame with square polygon geometries

create_sample_raster(filepath): Generate a small synthetic raster file with three bands

setUp(): Set up temporary directories

tearDown(): Clean up temporary files

test_convert(): Test vector to raster conversion

test_filter(): Test vector filtering

test_merge(): Test vector merging

test_metadata(): Test metadata population

Modules

Convert

Filter

Merge

Metadata

stats

test