Search code examples
c#.netvb.netado.netsqldatareader

What 'length' parameter should I pass to SqlDataReader.GetBytes()


I have a SqlDataReader and need to read a varbinary(max) column from it using the SqlDataReader.GetBytes() method. This method populates a byte array and therefore needs to know what length of data to read.

This is where I get confused.. Clearly I want to read all the data that has been returned from the database in this row/column so what 'length' parameter should I pass?

As far as I can see, the SqlDataReader doesn't provide any methods to discover what length of data is available, therefore this method seems fairly awkward to me.

I'm tempted to just pass int.MaxValue here and forget about the issue but something about this doesn't sit right with me.

I am aware that I can instead call

byte[] value = (byte[])dataReader["columnName"];

.. and this seems to completely take care of the length issue internally. However I am working with a set of complicated code generation templates that have been built around the SqlDataReader.GetXXXX() methods. So I am tied into using GetBytes and need to understand its proper usage.


Solution

  • When dealing with varbinary(max), there are two scenarios:

    • the length of the data is moderate
    • the length of the data is big

    GetBytes() is intended for the second scenario, when you are using CommandBehaviour.SequentialAccess to ensure that you are streaming the data, not buffering it. In particular, in this usage you would usually be writing (for example) in a stream, in a loop. For example:

    // moderately sized buffer; 8040 is a SQL Server page, note
    byte[] buffer = new byte[8040]; 
    long offset = 0;
    int read;
    while((read = reader.GetBytes(col, offset, buffer, 0, buffer.Length)) > 0) {
        offset += read;
        destination.Write(buffer, 0, read); // push downstream
    }
    

    However! If you are using moderately sized data, then your original code:

    byte[] data = (byte[])reader[col];
    

    is fine!!. There is nothing wrong with this approach, and in fact the Get* API is broken in a few cases - GetChar() being a notable example (hint: it doesn't work).

    It doesn't matter that you have existing code that uses Get* - in this case, the cast approach is perfectly appropriate.