Search code examples
javamysqlarraysblobvarchar

BLOB vs. VARCHAR for storing arrays in a MySQL table


I've got a design decision to make and am looking for some best practice advice. I have a java program which needs to store a large number (few hundred a day) of floating point arrays in a MySQL database. The data is a fixed length Double array of length 300. I can see three reasonable options:

  1. Store the data as a BLOB.
  2. Serialize the data and store it as a VARCHAR.
  3. Write the data to disk as a binary file and store a reference to it instead.

I should also mention that this data will be read from and updated frequently.

I want to use a BLOB since that is what I have done in the past and it seems like the most efficient method (e.g., maintains fixed width & no need to convert to a comma separated string). However my coworker is insisting that we should serialize and use varchar for reasons which seem mostly dogmatic.

If one of these methods is better than the other, are the reasons Java or MySQL specific?


Solution

  • Store as a BLOB like so (see code example below). I think this is probably better than using java serialization since java's builtin serialization will need 2427 bytes, and non-java applications will have a harder time dealing with the data. That is, should there ever be any non-java applications querying the database in the future.... if not then the builtin serialization is a few less lines.

    public static void storeInDB() throws IOException, SQLException {
    
        double[] dubs = new double[300];
    
        ByteArrayOutputStream bout = new ByteArrayOutputStream();
        DataOutputStream dout = new DataOutputStream(bout);
        for (double d : dubs) {
            dout.writeDouble(d);
        }
        dout.close();
        byte[] asBytes = bout.toByteArray();
    
        PreparedStatement stmt = null;  // however we normally get this...
        stmt.setBytes(1, asBytes);
    
    }
    
    public static double[] readFromDB() throws IOException, SQLException {
    
        ResultSet rs = null;  // however we normally get this...
        while (rs.next()) {
            double[] dubs = new double[300];
            byte[] asBytes = rs.getBytes("myDoubles");
            ByteArrayInputStream bin = new ByteArrayInputStream(asBytes);
            DataInputStream din = new DataInputStream(bin);
            for (int i = 0; i < dubs.length; i++) {
                dubs[i] = din.readDouble();
            }
            return dubs;
        }
    
    }
    

    Edit: I'd hoped to use BINARY(2400), but MySQL says:

    mysql> create table t (a binary(2400)) ;
    ERROR 1074 (42000): Column length too big for column 'a' (max = 255);
    use BLOB or TEXT instead