Add entity geocoding tool#8003
Open
ksuderman wants to merge 5 commits into
Open
Conversation
- Extracts location entities from NLP-annotated JSON (spaCy/Stanza) - Geocodes GPE, LOC, FAC, and ORG entities using Nominatim - Dual output: GeoJSON for interactive maps + tabular summary - Galaxy OpenLayers integration for map visualization - Configurable Nominatim server (public or self-hosted) - Entity type selection and deduplication - Pure Python with urllib.request (no geopy dependency) - Rate-limited public API or unlimited self-hosted options - Comprehensive tests and documentation - Enables spatial analysis of text corpora Tool: geocode_entities (v1.0.0+galaxy1) Categories: Text Manipulation, Natural Language Processing Citation: OpenStreetMap contributors
- Geocodes named entities from NER JSON to GeoJSON format - Uses Nominatim geocoding service for location resolution - Outputs interactive GeoJSON for map visualization in Galaxy - Works with JSON output from spaCy, Stanza, and CoreNLP NER - Includes entity aggregation and coordinate mapping Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>
RZ9082
reviewed
May 20, 2026
Member
RZ9082
left a comment
There was a problem hiding this comment.
This PR is adding two different directories with identical content! So please let's start by removing one of them.
Keep only tools/geocode/ to resolve duplicate tools issue
- Update profile from 21.05 to 24.1 - Remove macros.xml and inline version - Fix homepage_url and remote_repository_url to point to IUC repository - Add Galaxy copyright notice to Python script - Add ftype attributes and has_n_rows to test assertions
Author
|
Addressed review issues - removed duplicate tools/geocode_entities/ directory and applied standard IUC fixes |
Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>
bgruening
reviewed
May 20, 2026
| <tool id="geocode_entities" name="Geocode Named Entities" version="@TOOL_VERSION@+galaxy@VERSION_SUFFIX@" profile="24.1"> | ||
| <macros> | ||
| <token name="@TOOL_VERSION@">1.0.0</token> | ||
| <token name="@VERSION_SUFFIX@">1</token> |
Member
There was a problem hiding this comment.
Suggested change
| <token name="@VERSION_SUFFIX@">1</token> | |
| <token name="@VERSION_SUFFIX@">0</token> |
bgruening
reviewed
May 20, 2026
| <requirement type="package" version="3.12">python</requirement> | ||
| </requirements> | ||
| <command detect_errors="exit_code"><![CDATA[ | ||
| python '$__tool_directory__/geocode_entities.py' |
Member
There was a problem hiding this comment.
can you please add for all those files a https://docs.galaxyproject.org/en/latest/dev/schema.html#tool-required-files entry this will help in Pulsar settings.
bgruening
reviewed
May 20, 2026
| <param name="nominatim|source" value="public"/> | ||
| <output name="geojson_output" ftype="geojson"> | ||
| <assert_contents> | ||
| <has_text text="FeatureCollection"/> |
bgruening
reviewed
May 20, 2026
|
|
||
| ## Version History | ||
|
|
||
| - **1.0.0+galaxy1**: Initial release with GeoJSON output and OpenLayers integration |
Member
There was a problem hiding this comment.
this was never released, not sure if its useful to keep the history here
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Test plan
🤖 Generated with Claude Code