by Leigh Roberts >> Tue, 17 Aug 1999 0:25:01 GMT
I have a need to rework the convertCsv method below that takes a line from a comma seperated variable file and converts it to an array of strings. I didn't write the code but I can see where I can make some improvements. The method is used in parsing files with 10's of thousands of lines, so I want it to run as fast as possible.
It also handles the situation where commas in the text are surrrounded by quotes. I think that this very rare situation could be dealt with by finding out if there are any quotes at all in the line and then performing the more expensive checks to weed the comma out.
Any comments on the relative speed of techniques would be appreciated.
Thanks
Leigh
convertCsv(): StringArray updating;
vars
i, j, pos :Integer;
len :Integer;
sa :StringArray;begin
create sa transient;
pos := 1;
i := 1;
while pos <= self.length do
// Locate next comma
len := self.pos (",",pos);
len := len - pos;
if self.pos (",",pos) = 0 and pos <= self.length then
len := self.length - pos + 1;
endif;
// Put string into array
sa := self [pos:len];
// If comma inside quotes, concatenate with rest of
// string
while sa [1] = '"' and sa [sa .length] <> '"' do
pos := pos + len + 1;
len := self.pos(",",pos);
len := len - pos;
sa := sa & "," & self [pos:len];
endwhile;
// Remove quotes from the string
if sa [1] = '"' and sa [sa .length] = '"' then
sa := sa [i] [2:sa [i].length-2];
endif;
sa [i] := sa [i].trimRight;
// Increment position to just after the last comma
pos := pos + len + 1;
i := i + 1;
endwhile;
// sa contains full line, and can now be used for display, etc
return sa;
end;