Hi - some answers/ observations:
1. Yes - we supply data "as is" from the feeds we have contracted for license purposes, and are aware of the issues especially pre-2007 (though should point out that these issues tend to be things like flags for all weather rather than anything fundamental such as result, going etc). We are in the process of correcting these on a permanent basis, though are talking to source data providers to do so rather than fixing what we think is the case (the truth is that all racing data has some errors). Having said that if there are particular issues you are aware of that you would like to see addressed please do use the
http://answers.betwise.net/feedback address.
2. Typically users are doing some scripting (Perl, Python, R etc) to bash the data into the shape that they want as well as using SQL alone. SQLWorkbench, Heidi and (our favourite) SequelPro are all good tools on the SQL front though.
3. Though some conventional queries on the raw data can make a profit, there are very few or they throw up few contenders. The key to using the raw data to look for profitable angles is to start looking at creating derived variables from the raw data, then use these derived variables in combination with the conventional data.
For example, try creating rolling trainer strike rates (overall, by course, by distance and by race type) or sire statistics (ditto on categories) within a separate table that can be queried in combination with the raw data. Also, profit from some of the unique data elements that are in Smartform to do this - such as betting movements and betting ranking.
Hope this helps