SQL Server – Page 4 – R.J. Dudley

SQL Prompt Free until 9/1/2006!

At the last BADNUG meeting, someone mentioned
they wished there was Intellisense for SQL Server. I said there was such a
product; Red Gate recently purchased it, and is
now giving it away until Sep. 1, 2006!

Also included in the package is a 14-day trial of Red Gate’s Dependency
Tracker, which is a wickedly-cool database diagrammer.

More information and download link at http://www.red-gate.com/products/sql_prompt/index.htm.

Tracing Stored Procedures

One of the great things about stored procedures is that you
can put a whole bunch of different commands into one procedure, and call
everything with a single command. Gnarly! However, if the procedureruns slowly, it’s difficult to determine
which command(s) are the problem. Bogus!

One trick I use is to sprinkle little logging steps
throughout my stored procedure, and try to narrow down which command is causing
the problem. These steps just log a step name and a timestamp into a
table.

My table has the following structure:

CREATE TABLE [tblCommands] (
[fldID]
[int] IDENTITY (1, 1) NOT NULL ,
[fldCommand] [nvarchar] (2000)
COLLATE SQL_Latin1_General_CP1_CI_AS NULL ,
[fldDate] [datetime] NOT
NULL CONSTRAINT [DF_tblCommands_fldDate] DEFAULT (getdate())
) ON
[PRIMARY]
GO

fldCommand is either a good name of the setp being logged,
or something descriptive about what’s happening in the procedure. Just so
long as I can find it again in the sproc.

Then, I add commands such as

insert into tblCommands(fldCommand, fldDate)
values (‘Final selection’,getdate())

— do my selection here

insert into tblCommands(fldCommand, fldDate)
values (‘End’,getdate())

Reviewing the data from
tblCommands:

fldID
fldCommand
fldDate
———– —————-
———————–
31
Start
2006-05-11 13:18:19.943
… (snipped for
brevity)
39          Final
selection    2006-05-11 13:18:24.240
40
End
2006-05-11 13:19:56.677

We can see that the entire procedure runs in roughly 1:40,
with approx. 1:30 of that wrapped up in the final selection step alone. By
focusing on only the final step, I was able to optimize the query so that the
entire procedure runs in 10 seconds.

I realize that with the naming convention I’ve used here,
I’m likely to raise the ire of naming zealots. I like appending a prefix
to column names because that means I can use a reserved word such as ‘Date’ as a
column name. If you don’t like it, you’re fere to use a different
convention in your own sprocs.

Eric brings up a good
point in his comment below about using the Query Analyzer (part of the SQL
Server 2000 tools). I have used the Query Analyzer extensively, and one
place it falls short is with dynamic SQL statements in your queries. If
you have properly formatted dynamic sql (see http://www.rjdudley.com/blog/Is+Dynamic+SQL+In+Your+Stored+Procedures+Vulnerable+To+SQL+Injection.aspx),
and especially more than one dynamic SQL statement, the Query Analyzer can’t
analyze the sproc. Adding sinks like I have above can help debug an entire
sproc, especially those with dynamic SQL.

Monitoring DTS Steps

Part of my work involves a SQL Server 2000 data warehouse,
updated nightly with extracts from an ERP system. I knew that the DTSs
tended to run long, and I wanted to get a better idea of exactly how much data
we were moving around.

First thing I did was add a table with the following
schema:

CREATE TABLE [tblDtsLog]
(
[fldEntry] [int] IDENTITY (1, 1) NOT NULL ,
[fldStep]
[int] NOT NULL ,
[fldStepType] [varchar] (5) COLLATE
SQL_Latin1_General_CP1_CI_AS NOT NULL ,
[fldStepName] [varchar] (50)
COLLATE SQL_Latin1_General_CP1_CI_AS NOT NULL ,
[fldTimeStamp]
[datetime] NOT NULL ,
[fldNotes] [varchar] (255) COLLATE
SQL_Latin1_General_CP1_CI_AS NULL ,
[fldPackage] [varchar] (50)
COLLATE SQL_Latin1_General_CP1_CI_AS NULL
) ON
[PRIMARY]

This is a pretty general structure, and allows me to
add different logging and debugging steps to the DTS. fldEntry is simply
an incrementing value to help sorting what happened when. fldStep refers
to the stepf of the DTS package, fldStepType is a short description of the DTS
step. I use fldStep because someone always asks me what that step is doing
when we’re looking at the numbers, and it’s easier to log it than open the DTS
and find it. fldStepName is there because I inherited almost all of these
DTSs, and the names don’t always describe what’s going on. fldTimeStamp is
the execution time of the step, fldNotes are some information I want to include
about the step (usually just a rowcount), and fldPackage is the name of the DTS
package.

One of the early steps for most DTSs is the extract
from the ERP system to an intermediate table. Depending on the package,
the data in the intermediate table used for comparisons or modified before being
migrated to the production tables. By adding a SQL task immediately
after the extract, I could get an exact idea of how many data rows we just
moved:

select * from IntermediateTable
insert
into tblDtsLog(fldStep, fldStepType, fldStepName, fldTimeStamp,fldNotes,
fldPackage) values (1,’Begin’, ‘Extract Step 1’,getdate(),@@rowcount, ‘Some
DTS’)

This simly adds one entry to the above table with a
count of how many rows exist in the intermediate table. For other steps, I use the Begin and End flags in conjunction with
the getdate() to see how much time elapsed for the processing of a particular
matching or modification step. By sprinkling these liberally through my
DTSs (some packages have 15-20 of these logging sinks in them), I have excellent
insight into what is running slow, and then we can examine each step and try to
optimize it.

Recently, we used the results of this monitoring to
make a small tweak to one step of one DTS, which resulted in an overall gain of
2 hours of processing time in the overall update!

Copying DTS Packages to a Different Server in SQL Server 2000

Freaky–we were just talking about this yesterday afternoon.

The latest SQLServerCentral.com
update came out today, and the lead story is an update to an older article
titled “Copying
DTS Packages to a Different Server“. The article has some code samples
to show you how you can copy DTS packages from one SQL Server 2000 to
another.

If you’re lazy, don’t bother with the code. Instead, there’s a great
GUI tool called DTSBackup
2000 from SQLDTS.com. Using
DTSBackup 2000 couldn’t be easier–choose your origin server (supply
credentials if necessary), choose the DTSs you want to copy, choose your
destination server (supply credentials if necessary), add in the owner pwd if
necessary, and click Transfer. Within minutes, your packages are
transferred. It’s sweet, and works like a charm.

I use this to copy DTSs from my dev/test server to the prod server, and back
again if necessary. Layout in the visual designer gets a little goofy, but
that’s the only downside.

Is Dynamic SQL in Your Stored Procedures Vulnerable to SQL Injection?

We all should be familiar with the fact that concatenating user input directly into SQL statements is an open invitation to an SQL Injection attack. Code such as
MySql = “Select * from Orders where Customer ID='” & txtCustomerId & “‘”
should be avoided. If you need some more background information on SQL Injection attacks, I am building a reference at http://www.rjdudley.com/blog/CrossSiteScriptingXSSAttacksSQLInjectionAndASPNET.aspx. This reference will be updated as time goes on–there are a few good references now, and I’ll post update notices to the security section of this blog.

The recommended practice for avoiding SQL Injection attacks is to use parameterized queries or stored procedures (sprocs), where user input is passed as parameters. Since information in parameters is not treated as executable code, any SQL code conatined in the user input is rendered harmless. Or is it? This depends on what you do with that input inside of your sproc.

One of the common functions on a web site is querying a data store. In advanced searches (those with more than a single input), it would be infeasible to create and mainatin an sproc for every combination of search critera. Instead, one practice is to create an sproc that dynamically creates the SELECT statement based on the parameters passed to it. Typically, there is an input parameter for each input on the search form, which is rendered optional by adding “=NULL” after the parameter declaration (e.g., @orderId int=NULL). Then, the sproc uses a series of statements such as

IF @orderId IS NOT NULL
select @sql = @sql + ‘ AND order_id=’ + @orderId

to generate the complete SQL statement. At the end of the sproc, the EXECUTE statement is used to query the database using the dynamically generated SQL statement.

I remember what a revolutionary concept dynamic SQL in an sproc was for me when I was learning to write SQL. It opened up a whole new way of writing SQL code and handling advanced searches on my websites. But did you catch the security problem in the previous SQL statement? I didn’t at first, and in fact, I’ve been making this same security mistake for some time now. It wasn’t until I finally listened to Kim Tripp on DotNetRocks that I realized the problem (download the show from http://www.dotnetrocks.com/default.aspx?showID=75), and fortunately I only have a few sprocs to rewrite and fix this problem.

Look carefully at the statement again. It looks like the parameter is being used in the SQL statement, but in reality, the parameter’s value is being concatenated to the SQL statement. The technique demonstrated above is no better than the technque we dismissed in the first paragraph.

After listening to Kim’s show, I did some digging around, and found an excellent reference on how to handle dynamic SQL in search queries at http://www.sommarskog.se/dyn-search.html. In this article, Microsoft Sql Server MVP Erland Sommarskog details ways to use dynamic and static SQL to perform searches that have a number of possible combinations of inputs.

As Erland shows us, the correct way to use dynamic SQL in the situation I presented above is to concatenate another parameter into the SQL statement, as so:

IF @orderId IS NOT NULL
select @sql = @sql + ‘ AND order_id=@xorderId’

We then create a parameter list of these second parameters, as so:

SELECT @paramList = ‘@xorderId’

To finally execute the query, we execute a system sproc named sp_executesql. As Erland states:

sp_executesql is a system procedure with a very special parameter list. The first parameter is a parameterized SQL statement. The second parameter is a parameter-list declaration, very similar to the parameter list to a stored procedure. And the remaining parameters are simply the parameters in that parameter-list parameter.

Our final statement would end up looking like:

EXECUTE sp_executesql @sql, @paramList, @orderId

And with this technique, our query is safe from malicious user input. This whole process is outlined in detail in Erland’s article.

Since writing sprocs as outlined in Erland’s article can be tedious, I created a CodeSmith template that will do the work for you. You only need to input the table you wish to query, and CodeSmith will generate a complete sproc for you. You can then edit the sproc down, since it will include every column in the table. You can find the template at http://www.ericjsmith.net/codesmith/forum/default.aspx?f=9&m=4346.