We are building a series of spatial ETL tools to help our clean up some of the data we have been receiving. I have run into a problem where each road segment name takes the following form:
"1500 W Saint George St"
"1400 W Saint George St"
"1300 W Saint George St"
Using the string seacher I am able to identify all the names that begin with a numeric character using ^[0-9]. The problem is when the matched names are retured (those starting with a numeric value), I am unable to retain the later portion of the string from the end of the numeric variable forward.
In addition to this, the numbers are not always in the same format, for example some street names are listed as such:
"-1 George St."
"500-600 George St"
"Nanaimo Ave"
My idea is to combine a series of string searchers to ensure that I capture all the variables that begin with symbols or numbers and then use the space following those features to seperate the name out.
Unfourtunately I have been unsucessfull in doing so.
Thanks in advance,
Nelson