POST
|
Ok. I made this from scratch using Google Refine. It's attached as a python file because the forum does not allow uploads of text files larger than 12.5kb. You'll need to change the extension to .dic and then add it via the Open From File dialog. I'm assuming you know how to do that. Here was my process if interested: Visited the official USPS abbreviation list Copied everything in the Street Suffixes section Pasted it into MS Excel (to preserve tables) Brought the tables into Google Refine Did some transformations and deleted the unneeded first column (code is below) Exported as space separated Used text editor to replace triple double quotes with single double quotes QC'd data... Found and deleted one duplicate (ANNEX) and one error (CT--> CTS), perhaps I'll notify USPS. Uploaded here Google Refine code used: [ { "op": "core/column-move", "description": "Move column Commonly Used Street*Suffix or Abbreviation to position 0", "columnName": "Commonly Used Street*Suffix or Abbreviation", "index": 0 }, { "op": "core/column-removal", "description": "Remove column Primary Street Suffix", "columnName": "Primary Street Suffix" }, { "op": "core/row-removal", "description": "Remove rows", "engineConfig": { "facets": [ { "invert": false, "expression": "isBlank(value)", "selectError": false, "omitError": false, "selectBlank": false, "name": "Commonly Used Street*Suffix or Abbreviation", "omitBlank": false, "columnName": "Commonly Used Street*Suffix or Abbreviation", "type": "list", "selection": [ { "v": { "v": true, "l": "true" } } ] } ], "mode": "row-based" } }, { "op": "core/text-transform", "description": "Text transform on cells in column Commonly Used Street*Suffix or Abbreviation using expression value.trim()", "engineConfig": { "facets": [ { "invert": false, "expression": "isBlank(value)", "selectError": false, "omitError": false, "selectBlank": false, "name": "Commonly Used Street*Suffix or Abbreviation", "omitBlank": false, "columnName": "Commonly Used Street*Suffix or Abbreviation", "type": "list", "selection": [ { "v": { "v": false, "l": "false" } } ] } ], "mode": "row-based" }, "columnName": "Commonly Used Street*Suffix or Abbreviation", "expression": "value.trim()", "onError": "keep-original", "repeat": false, "repeatCount": 10 }, { "op": "core/text-transform", "description": "Text transform on cells in column Commonly Used Street*Suffix or Abbreviation using expression value.trim()", "engineConfig": { "facets": [], "mode": "row-based" }, "columnName": "Commonly Used Street*Suffix or Abbreviation", "expression": "value.trim()", "onError": "keep-original", "repeat": false, "repeatCount": 10 }, { "op": "core/text-transform", "description": "Text transform on cells in column Commonly Used Street*Suffix or Abbreviation using expression value.replace(/\\s+/,' ')", "engineConfig": { "facets": [], "mode": "row-based" }, "columnName": "Commonly Used Street*Suffix or Abbreviation", "expression": "value.replace(/\\s+/,' ')", "onError": "keep-original", "repeat": false, "repeatCount": 10 }, { "op": "core/text-transform", "description": "Text transform on cells in column Commonly Used Street*Suffix or Abbreviation using expression grel:value.replace(\"\\s\",value)", "engineConfig": { "facets": [], "mode": "row-based" }, "columnName": "Commonly Used Street*Suffix or Abbreviation", "expression": "grel:value.replace(\"\\s\",value)", "onError": "keep-original", "repeat": false, "repeatCount": 10 }, { "op": "core/text-transform", "description": "Text transform on cells in column Commonly Used Street*Suffix or Abbreviation using expression grel:value.replace(\" \",value)", "engineConfig": { "facets": [], "mode": "row-based" }, "columnName": "Commonly Used Street*Suffix or Abbreviation", "expression": "grel:value.replace(\" \",value)", "onError": "keep-original", "repeat": false, "repeatCount": 10 } ]
... View more
08-03-2012
12:33 PM
|
0
|
0
|
1873
|
POST
|
I was looking for it too, with no luck, so I'm just going to create it. Standby.
... View more
08-03-2012
10:51 AM
|
0
|
0
|
1873
|
Online Status |
Offline
|
Date Last Visited |
11-11-2020
02:23 AM
|