Stop focusing on object-oriented programming (OOP). Yes, in Python everything is an object and it was built from the ground up to facilitate the object-oriented programming paradigm, but you don’t actually need to know a single thing about OOP to write cromulent programs in Python, and in fact the vast majority of people using Python don’t have a clue about what an object oriented program even looks like. Python supports every major programming paradigm—procedure, declarative, concurrent, functional, et cetera—and can even be used as an imperative tool from the IDLE or IPython shell, so you can literally just type in equations or logic statements and get an answer the way you would with Matlab or BASIC or test whether a particular operation will work.
For what you are doing, you don’t really need to know anything about the deep functionality of Python beyond being able to import modules (literally a single line command), use the builtin string functionality, list slicing and comparison operations, and basic file I/O using with open… and the csv library. If you wanted to get clever you could learn the Openpyxl to be able to directly import from and manipulate Excel spreadsheets or the Pandas library to create spreadsheet-like dataframes and perform complex manipulations far more easily and reliably than in Excel but if you are content with the manual labor of saving Excel files as a CSV there is no reason to even do that.
It is really not as scary as you are making it out to be in your head.
I process different kinds and formats of complex data all the time using Python (and regex), a lot of it extracted from random Fortran and C tools where people just made up whatever format they liked, or people who think it is clever to dump time history data with different lengths, formats, and sample rates all in to one giant Excel spreadsheet. While it can sometimes be tricky to come up with a regex statement or a string command that correctly parse out the structure of the data, that is just an ability to build sufficient flexibility into the expression and have the code throw an exception when it encounters something that won’t parse.
I don’t do a lot of having to parse natural language data but pyparsing is purpose designed for that use. For Western-style street and mail addresses that do have a linear structure it could make short work of addresses that are just broken up across fields to parse out street name and number from city and state, and frankly if you poke around a bit you can probably find a utility that someone has written to do just that. (It would be hopeless with Japanese addresses, and I’m always thankful that in Japan every landline has an associated GPS coordinate position and most businesses provide mapcodes you can just punch into your rental car nav system).
There is really no reason to buy an expensive data analytics package to perform basic data formatting and processing, and frankly it probably is just going to make your job harder. The Salesforce rep is doubtless pitching it because he can then follow up offering you consluting services for the low low price of your left kidney and right testicle. The last thing you want to do is get locked into some kind of ‘enterprise software’ system that does a bunch of shit you will never need and requires expensive support to keep it functioning for your purpose.
Stranger