Fortran 2003 variant of ISO_VARYING

Ian Harvey

unread,

Mar 31, 2016, 6:39:53 PM3/31/16

to

Inspired by some of the recent threads about allocatable deferred length
character, I have updated the implementation of ISO_VARYING_STRING that
I occasionally use to take advantage of more of Fortran 2003.

The sources is at http://www.megms.com.au/download/aniso_varying_string.f90

Some documentation is at http://www.megms.com.au/aniso_varying_string.htm

This implementation was originally the one by Rich Townsend, using a
deferred size allocatable character array to store the character data of
the string, but I have changed this to an allocatable deferred length
character scalar (so all bugs are hence my fault).

This change simplifies many of the operations associated with the type.
The allocatable deferred length scalar component is also publicly
accessible, which permits things like substring operations directly on
that component and use of the structure constructor for the type.

But here is where the real fun begins... I have also provided defined
input and output for the type.

Unformatted input/output simply reads/writes the string length as a
default integer followed by the character data of the string.

Formatted defined input and output for list directed and namelist
formatting are implemented to mirror that of list directed and namelist
formatting of the intrinsic character type, including the requirement
that namelist input be delimited. Explicitly formatted input, without
the optional character literal in the format specifier, also behaves
similarly to list directed input, while explicitly formatted output (the
character literal and v_list array must not be present in the specifier)
behaves just like the A edit descriptor.

For explicitly formatted input I have also allowed the behaviour to be
modified by the contents of the character literal following the DT edit
descriptor (the v_list array is not used and must not be present). In
the face of the practically infinite range of possibilities for how
people might want to read character data from a file into a string, I
found the "design" of these modifiers to be a bit arbitrary. Perhaps
readers have some better suggestions.

Some examples of the latter, assuming the variables starting with `vs`
are declared of type `varying_string`, and with the example record
delimited with backticks:

- A CSV-like record with a known number of fields, such as:

`noblanks, leading blanks,trailing blanks ,"delimited,value"`

could be read, perhaps with use of Fortran 2008's unlimited
format count:

READ (unit, "(*(DT'comma,noskipblank',:,1X))") &
vs1, vs2, vs3, vs4

and would result in:

vs1 = 'noblanks'
vs2 = ' leading blanks'
vs3 = 'trailing blanks '
vs4 = 'delimited,value'

where comma specifies that, in the absence of delimited
input, input for a value is terminated by a comma and
`noskipblank` specifies that leading blanks of a value
are significant.

- An entire line, such as:

` "An entire "" line" `

could be read:

READ (unit, "(DT'eor,nodelimited')") vs

and would result in:

vs = '"An entire "" line" '

where `eor` specifies that, in the absence of delimited
input, input for a value is terminated by the end of
record, and `nodelimited` suppresses consideration of
delimited input. Without specification to the contrary,
leading blanks in the value are not considered
significant.

If nodelimited was dropped from the character literal in
the format specification, the result would be:

vs = 'An entire " line'

i.e. the input is take to be the delimited value only,
with the usual character literal conventions for how
doubled delimiters are treated as a single delimiter
inside the value.

- Classic fixed width input is supported using the `fixed`
modifier. The letter characters out of the following:

`123abcdef456`

could be read:

READ (unit, "(3X,DT'fixed(6)')") vs

~~~

I am not sure whether my implementation of formatted defined
input/output is correct with respect to questions like "when
should/shall (answers might differ between those two) the defined input
procedure signal an end-of-record condition". For fixed width input
this might be obvious (the combination of format spec and item
*required* more characters, but there were no more in the record, so
defined input complains), but for varying length data, where end of
record might well be a valid indication of the end of the value, it is
not so clear. I've tended to go with the option of suppressing
end-of-record conditions if I got any data I considered useful, but I am
not sure if that is the right thing to do. I was also a bit uncertain
about how defined input was supposed to function in the context of list
directed and namelist output - where does responsibility sit for things
like skipping whitespace, separator characters, repeat specifications, etc.

Unfortunately I ran into some compiler issues here (using ifort 16.0.2)
that mean I cannot fully test/use this in anger yet (including for the
examples above), but I'm interested in any comments in the meantime.

Stefano Zaghi

unread,

Apr 1, 2016, 12:22:58 AM4/1/16

to

Dear Ian,

Thank you very much for sharing your work, it is very interedting, I will study it. A note: from a very superficial reading of your site I do not see the license of your code, do I miss this information? Can you point me to the license statement? Maybe it is into the code (I am using a smarthphone now and I cannot open your code...).

My best regards.

Ian Harvey

unread,

Apr 1, 2016, 1:57:33 AM4/1/16

to

The copyright/licence bits for Rich Townsend's original source are in
the comments at the top of the module source.

You can regard my changes to that module as either being public domain;
or involving insufficient creativity or intellectual effort to be
copyrightable in the first place; or available under the same licence as
for the original source, at your option - I really don't care.

Jos Bergervoet

unread,

Apr 1, 2016, 3:03:30 AM4/1/16

to

On 4/1/2016 12:39 AM, Ian Harvey wrote:
> Inspired by some of the recent threads about allocatable deferred length
> character, I have updated the implementation of ISO_VARYING_STRING that
> I occasionally use to take advantage of more of Fortran 2003.
>
> The sources is at http://www.megms.com.au/download/aniso_varying_string.f90
>
> Some documentation is at http://www.megms.com.au/aniso_varying_string.htm
>
> This implementation was originally the one by Rich Townsend, using a
> deferred size allocatable character array to store the character data of
> the string, but I have changed this to an allocatable deferred length
> character scalar

Thank you, Ian!

> (so all bugs are hence my fault).

They just make it more interesting of course.

--
Jos

Wolfgang Kilian

unread,

Apr 1, 2016, 3:53:15 AM4/1/16

to

Ian,

thanks a lot, that is definitely something that we'll have a look at.
(We're using ISO_VARYING_STRING throughout our code). The defined I/O
bit sounds particularly promising. Also, direct accessibility should
simplify interoperation with libraries that expect/provide C strings,
which is not straightforward with the current implementation. From your
description I guess that the new implementation requires a very recent
compiler, so that would exclude it for production use ...

(Compiler issues: Even the original Townsend implementation doesn't work
with all current compilers.)

-- Wolfgang

Clive Page

unread,

Apr 1, 2016, 5:04:43 AM4/1/16

to

Ian

Thank you for making that available - it looks like an exellent bit of
work. The only problem is, as you note, that not many compilers are
yet up-to-date enough to compile it, and that includes gfortran which is
one of the several which do not support defined I/O, or at least not yet
(one can hope...).

--
Clive Page

Stefano Zaghi

unread,

May 16, 2016, 4:33:19 AM5/16/16

to

Dear Ian,

I am not really sure, but it seems that at line 852 of your tests suite "aniso_varying_string-tests.f90" there is a typo:

IF (vs /= 'No quotes') THEN
stat = 1
RETURN
END IF

I think this should be `vs /= '"Has quotes"'`, but maybe I am wrong.

Thank you for your great work!

My best regards.

Ian Harvey

unread,

May 16, 2016, 7:11:10 AM5/16/16

to

On 2016-05-16 6:33 PM, Stefano Zaghi wrote:
> Dear Ian,
>
> I am not really sure, but it seems that at line 852 of your tests suite "aniso_varying_string-tests.f90" there is a typo:
>
> IF (vs /= 'No quotes') THEN
> stat = 1
> RETURN
> END IF
>
> I think this should be `vs /= '"Has quotes"'`, but maybe I am wrong.

Yes. Thanks.

Fortran 2003 variant of ISO_VARYING_STRING

Ian Harvey

Stefano Zaghi

Ian Harvey

Jos Bergervoet

Wolfgang Kilian

Clive Page

Stefano Zaghi

Ian Harvey