Perspectives on LedgerSMB: In defence of hand coded SQL

Sunday, August 18, 2013

In defence of hand coded SQL

One common comment I get when I point out I hand-write all my SQL queries rather than relying on an ORM or the like is that this is drudge work, obsoleted by modern tools, and when I mention these are usually wrapped in stored procedures, the reactions go from disdainful to horrified. This piece is the other side, why I do this and why I find it works. I am not saying these approaches are free of costs, but software engineering is about tradeoffs. These tradeoffs are real. My approach is not a magic bullet, but it forms a vital piece of how I build software on the database.

The first thing to note is that I use a lot of SELECT * FROM table queries when querying tables that match output structure. We all know we run into tables that cannot be reasonably further normalized and where the application structure can feed directly into the application. In a stored procedure wrapper, SELECT * reduces maintenance points of such tables if new fields need to be added (in which case the query still matches the specified return type with no modifications). This has costs in that it discourages refactoring of tables down the road but this just needs to be checked. One can still have central management by using views if needed. Central management of type definitions is generally a good thing. Views can take the place of an ORM....

The second point is that CRUD queries of this sort don't really take significant time to write, even on a well-normalized database and having these encapsulated behind a reasonably well-designed procedural interface is not a bad thing provided that some of the classical difficulties of stored procedures are addressed.

I find that my overall development time is not slowed down by hand-writing SQL. This remains true even as the software matures. The time-savings of automatic query tools is traded for the fact that one doesn't get to spend time thinking about how best to utilize queries in the application. The fact is that as application developers, we tend to do a lot in application code that could be better done as part of a query. Sitting down and thinking about how the queries fit into the application is one of the single most productive exercises one can do.

The reason is that a lot of data can be processed and filtered in the queries themselves. This allows one to request that the database send back data in the way the application can make best use of it. This can eliminate a lot of application-level code and lead to a shrinking codebase. This in turn allows application-level code to make better use of data returned from queries, which leads to better productivity all around.

15 comments:

Colin 't HartAugust 19, 2013 at 12:44 AM
AMEN!

SQL is a declarative (4G) language and like you say, requires a lot of thinking *up front*.
In my experience most developers are either lazy and don't want to think that much and/or want to dive right into coding. Other developers can't or won't think that way and prefer to fall-back to their 3G skills.
All of this -- combined with the fact that many people still don't understand the power of an RDBMS -- means that us SQL developers are rather looked down upon, even regarded as old-fashioned in the face of the latest fads.

The truth is that an app with well-designed views and SQL can be built with a much higher number of "function points per line of code", higher maintainability and fewer defects.
ReplyDelete
Replies
UnknownAugust 19, 2013 at 4:35 AM
I seem to remember having this conversation with a boss of mine. We concluded that there was probably a balance to be struck: allow a tool to auto generate the initial queries, but never rerun the tool, hand code everything thereafter. This gets all of your CRUD in place, and then allows you to optimize those queries as you come by later.

Doing it this way means you only ever use the tool once, meaning that cost savings are only realized at the very onset of the project.

"This allows one to request that the database send back data in the way the application can make best use of it."

That *is* the point. I totally agree. I've seen too many data processing routines that spend hours processing only to find that if someone had considered writing some SQL they would see they could have filtered at the database level, and skipped transferring massive quantities of data.

Sometimes I feel like the industry has moved backwards with people loading entire files from disk and doing the filtering in their app.
ReplyDelete
Replies
Alassane DiakitéAugust 19, 2013 at 7:30 AM
Hi
developpez.com needs your permission to the French translation of your articles on "Building a solid database". (phrase obtenue par google traduction!!!)
Thank you
ReplyDelete
Replies
UnknownAugust 19, 2013 at 2:54 PM
I think I just want something that allows me to easily map the results returned into objects. I'm otherwise perfectly content with hand writing the query itself.

I will admit however that I do not like looking at Stringy SQL amongst 3GL code, and would like to have something that moves SQL into it's own place.

When working on the database I want to think in the DB's terms (usually relational, SQL ) when thinking in my 3GL I usually want to think in terms of Objects. I don't need the mapper to make the queries, just turn the results into objects, and hide the queries from where I'm thinking about object logic.
ReplyDelete
Replies
cc youngAugust 19, 2013 at 9:24 PM
Despite my being a devout Atheist, let me add my "Amen" to the above.

Generated code is one of those things that works 90% of the time (probably), but it's really the other 10% that interesting.

For automatic features, like change logs, what we really need is an CREATE TABLE trigger to add those.

And, most genned SQL is butt ugly.
ReplyDelete
Replies
JoshAugust 20, 2013 at 9:29 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Lukas EderSeptember 12, 2013 at 4:47 AM
There is a lot of truth in the above, but there are also some merits to ORMs that cannot be denied. As with many things, the discussion boils down to

1. "It depends"
2. "I might personally prefer this over that"

SQL is a very underestimated technology. When I give talks about jOOQ, I'm constantly surprised about how many people are unaware of clauses like:

- HAVING
- CONNECT BY (Oracle)
- GROUPING SETS
- Window functions

... just to name a few. The problem probably isn't the fact that people are lazy or ignorant, or even sickened by SQL. The problem probably is the fact that a lot of middleware software vendors have generated high expectations towards ORM over the last 10 years, which ORM simply never can fulfil. And, there has been no one to help other "3G" developers with alternative, adequate tooling. Consequently, JPA 2.1 still continues specifying tweaks to patch lazy / eager fetching, which don't exactly simplify / beautify things.

I'd like to hear your opinion about www.hibernate-alternative.com, btw.
ReplyDelete
Replies

Add comment