Package 'Ramble' reference manual

Title:	Parser Combinator for R
Description:	Parser generator for R using combinatory parsers. It is inspired by combinatory parsers developed in Haskell.
Authors:	Chapman Siu
Maintainer:	Chapman Siu <[email protected]>
License:	MIT + file LICENSE
Version:	0.1.1
Built:	2024-11-04 19:53:39 UTC
Source:	https://github.com/8bit-pixies/ramble

`%alt%` is the infix notation for the `alt` function.

Description

%alt% is the infix notation for the alt function.

Usage

p1 %alt% p2
p1 %alt% p2

Arguments

`p1`	the first parser
`p2`	the second parser

Value

Returns the first parser if it suceeds otherwise the second parser

Examples

(item() %alt% succeed("2")) ("abcdef")
(item() %alt% succeed("2")) ("abcdef")

`%then%` is the infix operator for the then combinator.

Description

%then% is the infix operator for the then combinator.

Usage

p1 %then% p2
p1 %then% p2

Arguments

`p1`	the first parser
`p2`	the second parser

Value

recognises anything that p1 and p2 would if placed in succession.

Examples

(item() %then% succeed("123")) ("abc")
(item() %then% succeed("123")) ("abc")

`%thentree%` is the infix operator for the then combinator, and it is the preferred way to use the `thentree` operator.

Description

%thentree% is the infix operator for the then combinator, and it is the preferred way to use the thentree operator.

Usage

p1 %thentree% p2
p1 %thentree% p2

Arguments

`p1`	the first parser
`p2`	the second parser

Value

recognises anything that p1 and p2 would if placed in succession.

Examples

(item() %thentree% succeed("123")) ("abc")
(item() %thentree% succeed("123")) ("abc")

`%using%` is the infix operator for using

Description

%using% is the infix operator for using

Usage

p %using% f
p %using% f

Arguments

`p`	is the parser to be applied
`f`	is the function to be applied to each result of `p`.

Examples

(item() %using% as.numeric) ("1abc")
(item() %using% as.numeric) ("1abc")

Alpha checks for single alphabet character

Description

Alpha checks for single alphabet character

Usage

Alpha(...)
Alpha(...)

Arguments

...

additional arguments for the primitives to be parsed

Examples

Alpha()("abc")
Alpha()("abc")

AlphaNum checks for a single alphanumeric character

Description

AlphaNum checks for a single alphanumeric character

Usage

AlphaNum(...)
AlphaNum(...)

Arguments

...

additional arguments for the primitives to be parsed

Examples

AlphaNum()("123")
AlphaNum()("abc123")
AlphaNum()("123")
AlphaNum()("abc123")

`alt` combinator is similar to alternation in BNF. the parser `(alt(p1, p2))` recognises anything that `p1` or `p2` would. The approach taken in this parser follows (Fairbairn86), in which either is interpretted in a sequential (or exclusive) manner, returning the result of the first parser to succeed, and failure if neither does.

Description

%alt% is the infix notation for the alt function, and it is the preferred way to use the alt operator.

Usage

alt(p1, p2)
alt(p1, p2)

Arguments

`p1`	the first parser
`p2`	the second parser

Value

Returns the first parser if it suceeds otherwise the second parser

Examples

(item() %alt% succeed("2")) ("abcdef")
(item() %alt% succeed("2")) ("abcdef")

Digit checks for single digit

Description

Digit checks for single digit

Usage

Digit(...)
Digit(...)

Arguments

...

additional arguments for the primitives to be parsed

Examples

Digit()("123")
Digit()("123")

`ident` is a parser which matches zero or more alphanumeric characters.

Description

ident is a parser which matches zero or more alphanumeric characters.

Usage

ident()
ident()

Examples

ident() ("variable1 = 123")
ident() ("variable1 = 123")

`identifier` creates an identifier

Description

identifier creates an identifier

Usage

identifier(...)
identifier(...)

Arguments

...

takes in token primitives

`item` is a parser that consumes the first character of the string and returns the rest. If it cannot consume a single character from the string, it will emit the empty list, indicating the parser has failed.

Description

item is a parser that consumes the first character of the string and returns the rest. If it cannot consume a single character from the string, it will emit the empty list, indicating the parser has failed.

Usage

item(...)
item(...)

Arguments

...

additional arguments for the parser

Examples

item() ("abc")
item() ("")
item() ("abc")
item() ("")

`literal` is a parser for single symbols. It will attempt to match the single symbol with the first character in the string.

Description

literal is a parser for single symbols. It will attempt to match the single symbol with the first character in the string.

Usage

literal(char)
literal(char)

Arguments

char

is the character to be matched

Examples

literal("a") ("abc")
literal("a") ("abc")

Lower checks for single lower case character

Description

Lower checks for single lower case character

Usage

Lower(...)
Lower(...)

Arguments

...

additional arguments for the primitives to be parsed

Examples

Lower() ("abc")
Lower() ("abc")

`many` matches 0 or more of pattern `p`. In BNF notation, repetition occurs often enough to merit its own abbreviation. When zero or more repetitions of a phrase `p` are admissible, we simply write `p*`. The `many` combinator corresponds directly to this operator, and is defined in much the same way.

Description

This implementation of many differs from (Hutton92) due to the nature of R's data structures. Since R does not support the concept of a list of tuples, we must revert to using a list rather than a vector, since all values in an R vector must be the same datatype.

Usage

many(p)
many(p)

Arguments

`p`	is the parser to match 0 or more times.

Examples

Digit <- function(...) {satisfy(function(x) {return(grepl("[0-9]", x))})}
many(Digit()) ("123abc")
many(Digit()) ("abc")
Digit <- function(...) {satisfy(function(x) {return(grepl("[0-9]", x))})}
many(Digit()) ("123abc")
many(Digit()) ("abc")

`maybe` matches 0 or 1 of pattern `p`. In EBNF notation, this corresponds to a question mark ('?').

Description

maybe matches 0 or 1 of pattern p. In EBNF notation, this corresponds to a question mark ('?').

Usage

maybe(p)
maybe(p)

Arguments

`p`	is the parser to be matched 0 or 1 times.

Examples

maybe(Digit())("123abc")
maybe(Digit())("abc123")
maybe(Digit())("123abc")
maybe(Digit())("abc123")

`nat` is a parser which matches one or more numeric characters.

Description

nat is a parser which matches one or more numeric characters.

Usage

nat()
nat()

Examples

nat() ("123 + 456")
nat() ("123 + 456")

`natural` creates a token parser for natural numbers

Description

natural creates a token parser for natural numbers

Usage

natural(...)
natural(...)

Arguments

...

additional arguments for the parser

Ramble is a parser generator using combinatory parsers.

Description

Ramble allows you to write parsers in a functional manner, inspired by Haskell's Parsec library.

`satisfy` is a function which allows us to make parsers that recognise single symbols.

Description

satisfy is a function which allows us to make parsers that recognise single symbols.

Usage

satisfy(p)
satisfy(p)

Arguments

`p`	is the predicate to determine if the arbitrary symbol is a member.

`some` matches 1 or more of pattern `p`. in BNF notation, repetition occurs often enough to merit its own abbreviation. When zero or more repetitions of a phrase `p` are admissible, we simply write `p+`. The `some` combinator corresponds directly to this operator, and is defined in much the same way.

Description

some matches 1 or more of pattern p. in BNF notation, repetition occurs often enough to merit its own abbreviation. When zero or more repetitions of a phrase p are admissible, we simply write p+. The some combinator corresponds directly to this operator, and is defined in much the same way.

Usage

some(p)
some(p)

Arguments

`p`	is the parser to match 1 or more times.

Examples

Digit <- function(...) {satisfy(function(x) {return(grepl("[0-9]", x))})}
some(Digit()) ("123abc")
Digit <- function(...) {satisfy(function(x) {return(grepl("[0-9]", x))})}
some(Digit()) ("123abc")

`space` matches zero or more space characters.

Description

space matches zero or more space characters.

Usage

space()
space()

Examples

space() ("  abc")
space() ("  abc")

SpaceCheck checks for a single space character

Description

SpaceCheck checks for a single space character

Usage

SpaceCheck(...)
SpaceCheck(...)

Arguments

...

additional arguments for the primitives to be parsed

Examples

SpaceCheck()(" 123")
SpaceCheck()(" 123")

`String` is a combinator which allows us to build parsers which recognise strings of symbols, rather than just single symbols

Description

String is a combinator which allows us to build parsers which recognise strings of symbols, rather than just single symbols

Usage

String(string)
String(string)

Arguments

string

is the string to be matched

Examples

String("123")("123 abc")
String("123")("123 abc")

`succeed` is based on the empty string symbol in the BNF notation The `succeed` parser always succeeds, without actually consuming any input string. Since the outcome of succeed does not depend on its input, its result value must be pre-detemined, so it is included as an extra parameter.

Description

succeed is based on the empty string symbol in the BNF notation The succeed parser always succeeds, without actually consuming any input string. Since the outcome of succeed does not depend on its input, its result value must be pre-detemined, so it is included as an extra parameter.

Usage

succeed(string)
succeed(string)

Arguments

string

the result value of succeed parser

Examples

succeed("1") ("abc")
succeed("1") ("abc")

`symbol` creates a token for a symbol

Description

symbol creates a token for a symbol

Usage

symbol(xs)
symbol(xs)

Arguments

`xs`	takes in a string to create a token

Examples

symbol("[") ("  [123]")
symbol("[") ("  [123]")

`then` combinator corresponds to sequencing in BNF. The parser `(then(p1, p2))` recognises anything that `p1` and `p2` would if placed in succession.

Description

%then% is the infix operator for the then combinator, and it is the preferred way to use the then operator.

Usage

then(p1, p2)
then(p1, p2)

Arguments

`p1`	the first parser
`p2`	the second parser

Value

recognises anything that p1 and p2 would if placed in succession.

Examples

(item() %then% succeed("123")) ("abc")
(item() %then% succeed("123")) ("abc")

`thentree` keeps the full tree representation of the results of parsing. Otherwise, it is identical to `then`.

Description

thentree keeps the full tree representation of the results of parsing. Otherwise, it is identical to then.

Usage

thentree(p1, p2)
thentree(p1, p2)

Arguments

`p1`	the first parser
`p2`	the second parser

Value

recognises anything that p1 and p2 would if placed in succession.

Examples

(item() %thentree% succeed("123")) ("abc")

(item() %thentree% succeed("123")) ("abc")

`token` is a new primitive that ignores any space before and after applying a parser to a token.

Description

token is a new primitive that ignores any space before and after applying a parser to a token.

Usage

token(p)
token(p)

Arguments

`p`	is the parser to have spaces stripped.

Examples

token(ident()) ("   variable1   ")
token(ident()) ("   variable1   ")

Unlist is the same as unlist, but doesn't recurse all the way to preserve the type. This function is not well optimised.

Description

Unlist is the same as unlist, but doesn't recurse all the way to preserve the type. This function is not well optimised.

Usage

Unlist(obj)
Unlist(obj)

Arguments

obj

is a list to be flatten

Upper checks for a single upper case character

Description

Upper checks for a single upper case character

Usage

Upper(...)
Upper(...)

Arguments

...

additional arguments for the primitives to be parsed

Examples

Upper()("Abc")
Upper()("Abc")

`using` combinator allows us to manipulate results from a parser, for example building a parse tree. The parser `(p %using% f)` has the same behaviour as the parser `p`, except that the function `f` is applied to each of its result values.

Description

%using% is the infix operator for using, and it is the preferred way to use the using operator.

Usage

using(p, f)
using(p, f)

Arguments

`p`	is the parser to be applied
`f`	is the function to be applied to each result of `p`.

Value

The parser (p %using% f) has the same behaviour as the parser p, except that the function f is applied to each of its result values.

Examples

(item() %using% as.numeric) ("1abc")
(item() %using% as.numeric) ("1abc")

Package 'Ramble'

Help Index

%alt% is the infix notation for the alt function.

Description

Usage

Arguments

Value

Examples

%then% is the infix operator for the then combinator.

Description

Usage

Arguments

Value

Examples

%thentree% is the infix operator for the then combinator, and it is the preferred way to use the thentree operator.

Description

Usage

Arguments

Value

See Also

Examples

%using% is the infix operator for using

Description

Usage

Arguments

Examples

Alpha checks for single alphabet character

Description

Usage

Arguments

See Also

Examples

AlphaNum checks for a single alphanumeric character

Description

Usage

Arguments

See Also

Examples

Description

Usage

Arguments

Value

See Also

Examples

Digit checks for single digit

Description

Usage

Arguments

See Also

Examples

ident is a parser which matches zero or more alphanumeric characters.

Description

Usage

See Also

Examples

identifier creates an identifier

Description

Usage

Arguments

See Also

item is a parser that consumes the first character of the string and returns the rest. If it cannot consume a single character from the string, it will emit the empty list, indicating the parser has failed.

Description

Usage

Arguments

Examples

literal is a parser for single symbols. It will attempt to match the single symbol with the first character in the string.

Description

Usage

Arguments

Examples

Lower checks for single lower case character

Description

Usage

Arguments

See Also

Examples

many matches 0 or more of pattern p. In BNF notation, repetition occurs often enough to merit its own abbreviation. When zero or more repetitions of a phrase p are admissible, we simply write p*. The many combinator corresponds directly to this operator, and is defined in much the same way.

Description

Usage

Arguments

`%alt%` is the infix notation for the `alt` function.

`%then%` is the infix operator for the then combinator.

`%thentree%` is the infix operator for the then combinator, and it is the preferred way to use the `thentree` operator.

`%using%` is the infix operator for using

`ident` is a parser which matches zero or more alphanumeric characters.

`identifier` creates an identifier

`item` is a parser that consumes the first character of the string and returns the rest. If it cannot consume a single character from the string, it will emit the empty list, indicating the parser has failed.

`literal` is a parser for single symbols. It will attempt to match the single symbol with the first character in the string.

`maybe` matches 0 or 1 of pattern `p`. In EBNF notation, this corresponds to a question mark ('?').

`nat` is a parser which matches one or more numeric characters.

`natural` creates a token parser for natural numbers

`satisfy` is a function which allows us to make parsers that recognise single symbols.

`space` matches zero or more space characters.

`String` is a combinator which allows us to build parsers which recognise strings of symbols, rather than just single symbols

`succeed` is based on the empty string symbol in the BNF notation The `succeed` parser always succeeds, without actually consuming any input string. Since the outcome of succeed does not depend on its input, its result value must be pre-detemined, so it is included as an extra parameter.

`symbol` creates a token for a symbol

`then` combinator corresponds to sequencing in BNF. The parser `(then(p1, p2))` recognises anything that `p1` and `p2` would if placed in succession.

`thentree` keeps the full tree representation of the results of parsing. Otherwise, it is identical to `then`.