[m-rev.] diff: string splitting routines to string.m
Ondrej Bojar
obo at cuni.cz
Fri Feb 2 12:39:11 AEDT 2007
Ralph Becket wrote:
> In this case I'm just looking for a convincing case *for*
> split_at_string, but I'm not arguing *against* it.
It's a generalization of split_at_char. To be honest, I came across it
in one project only so far, but it simplifies nicely situations where
you want some hierarchy of delimiters:
sentence_id ||| I|pronoun am|verb sleepy|adj ||| scores
Usually, one would use 'tab' to delimit main columns and 'space' to
delimit words/subfields within fields. Some people are afraid of
whitespace delimiters and prefer to use a printable character. One needs
to escape the character (the whitespace too, but it's not that common
that your data actually contains whitespace). So for each level of
delimiting, you need another character and another escape sequence. If
your fields are never blank, you can however use just different number
of copies of the single delimiter char, to mark different levels of
segments. And this is where split_at_string is useful.
Probably not very convincing, though....
O.
--
Ondrej Bojar (mailto:obo at cuni.cz)
http://www.cuni.cz/~obo
--------------------------------------------------------------------------
mercury-reviews mailing list
Post messages to: mercury-reviews at csse.unimelb.edu.au
Administrative Queries: owner-mercury-reviews at csse.unimelb.edu.au
Subscriptions: mercury-reviews-request at csse.unimelb.edu.au
--------------------------------------------------------------------------
More information about the reviews
mailing list