antlr
Class SatherCodeGenerator
java.lang.Object
|
+--antlr.CodeGenerator
|
+--antlr.SatherCodeGenerator
- public class SatherCodeGenerator
- extends CodeGenerator
Untamed: Generate MY_PARSER.sa, MY_LEXER.sa and MY_PARSER_TOKENTYPES.sa
Field Summary
analyzer, behavior, bitsetsUsed, bitsetTestThreshold, charFormatter, currentOutput, DEBUG_CODE_GENERATOR, DEFAULT_BITSET_TEST_THRESHOLD, DEFAULT_MAKE_SWITCH_THRESHOLD, grammar, makeSwitchThreshold, tabs, TokenTypesFileExt, TokenTypesFileSuffix, tool
Constructor Summary
Method Summary
protected int
addSemPred(String predicate)
Adds a semantic predicate string to the sem pred vector
These strings will be used to build an array of sem pred names
when building a debugging parser.
void
gen()
Enabled: Generate the parser, lexer, treeparser, and token types in Java
void
gen(ActionElement action)
Enabled: Generate code for the given grammar element.
protected void
genBitsets(Vector bitsetList,
int maxVocabulary)
Generate all the bitsets to be used in the parser or lexer
Generate the raw bitset data like "long _tokenSet1_data[] = {...};"
and the BitSet object declarations like "BitSet _tokenSet1 = new BitSet(_tokenSet1_data);"
Note that most languages do not support object initialization inside a
class definition, so other code-generators may have to separate the
bitset declarations from the initializations (e.g., put the initializations
in the generated constructor instead).
private void
genBlockFinish(JavaBlockFinishingInfo howToFinish,
String noViableAction)
Generate the finish of a block, using a combination of the info
returned from genCommonBlock() and the action to perform when
no alts were taken
protected void
genCases(BitSet p)
Generate a series of case statements that implement a BitSet test.
protected void
genHeader()
Generate a header that is common to all Sather files
void
genRule(RuleSymbol s,
boolean startSymbol,
int ruleNum)
Enabled: Gen a named rule block.
protected void
genSemPredMap()
Write an array of Strings which are the semantic predicate
expressions.
void
genTokenStrings()
Enabled: Generate a static array containing the names of the tokens,
indexed by the token type values.
private String
getValueString(int value)
getValueString: get a string representation of a token or char value
void
setupOutput(String className)
Enabled: This method exists so a subclass, namely VAJCodeGenerator,
can open the file in its own evil way.
_print, _printAction, _println, elementsAreRange, extractIdOfAction, extractIdOfAction, extractTypeOfAction, extractTypeOfAction, genTokenInterchange, getBitsetName, lexerRuleName, markBitsetForGen, print, printAction, println, printTabs, processStringForASTConstructor, removeAssignmentFromDeclaration, reverseLexerRuleName, setAnalyzer, setBehavior, setGrammar, setTool
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Field Detail
syntacticPredLevel
protected int syntacticPredLevel
genAST
protected boolean genAST
saveText
protected boolean saveText
labeledElementType
String labeledElementType
labeledElementASTType
String labeledElementASTType
labeledElementInit
String labeledElementInit
commonExtraArgs
String commonExtraArgs
commonExtraParams
String commonExtraParams
commonLocalVars
String commonLocalVars
lt1Value
String lt1Value
exceptionThrown
String exceptionThrown
throwNoViable
String throwNoViable
currentRule
RuleBlock currentRule
currentASTResult
String currentASTResult
treeVariableMap
Hashtable treeVariableMap
astVarNumber
int astVarNumber
NONUNIQUE
protected static final String NONUNIQUE
caseSizeThreshold
public static final int caseSizeThreshold
- Enabled:
semPreds
private Vector semPreds
satherBlockId
private static int satherBlockId
Constructor Detail
SatherCodeGenerator
public SatherCodeGenerator()
- Enabled: Create a Java code-generator using the given Grammar.
The caller must still call setTool, setBehavior, and setAnalyzer
before generating code.
Method Detail
addSemPred
protected int addSemPred(String predicate)
- Adds a semantic predicate string to the sem pred vector
These strings will be used to build an array of sem pred names
when building a debugging parser. This method should only be
called when the debug option is specified
-
exitIfError
public void exitIfError()
- Enabled:
-
gen
public void gen()
- Enabled: Generate the parser, lexer, treeparser, and token types in Java
- Specified by:
gen in class CodeGenerator
-
gen
public void gen(ActionElement action)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
action - The {...} action to generate
gen
public void gen(AlternativeBlock blk)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
blk - The "x|y|z|..." block to generate
gen
public void gen(BlockEndElement end)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
end - The block-end element to generate. Block-end
elements are synthesized by the grammar parser to represent
the end of a block.
gen
public void gen(CharLiteralElement atom)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
atom - The character literal reference to generate
gen
public void gen(CharRangeElement r)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
r - The character-range reference to generate
gen
public void gen(LexerGrammar g)
throws IOException
- Enabled: Generate the lexer Java file
- Specified by:
gen in class CodeGenerator
IOException
gen
public void gen(OneOrMoreBlock blk)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
blk - The (...)+ block to generate
gen
public void gen(ParserGrammar g)
throws IOException
- Enabled: Generate the parser Java file
- Specified by:
gen in class CodeGenerator
IOException
gen
public void gen(RuleRefElement rr)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
rr - The rule-reference to generate
gen
public void gen(StringLiteralElement atom)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
atom - The string-literal reference to generate
gen
public void gen(TokenRangeElement r)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
r - The token-range reference to generate
gen
public void gen(TokenRefElement atom)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
atom - The token-reference to generate
gen
public void gen(TreeElement t)
- Enabled:
- Specified by:
gen in class CodeGenerator
-
gen
public void gen(TreeWalkerGrammar g)
throws IOException
- Enabled: Generate the tree-parser Java file
- Specified by:
gen in class CodeGenerator
IOException
gen
public void gen(WildcardElement wc)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
wc - The wildcard element to generate
gen
public void gen(ZeroOrMoreBlock blk)
- Enabled: Generate code for the given grammar element.
- Specified by:
gen in class CodeGenerator
- Parameters:
blk - The (...)* block to generate
genAlt
protected void genAlt(Alternative alt,
AlternativeBlock blk)
- Generate an alternative.
- Parameters:
alt - The alternative to generateblk - The block to which the alternative belongs
genBitsets
protected void genBitsets(Vector bitsetList,
int maxVocabulary)
- Generate all the bitsets to be used in the parser or lexer
Generate the raw bitset data like "long _tokenSet1_data[] = {...};"
and the BitSet object declarations like "BitSet _tokenSet1 = new BitSet(_tokenSet1_data);"
Note that most languages do not support object initialization inside a
class definition, so other code-generators may have to separate the
bitset declarations from the initializations (e.g., put the initializations
in the generated constructor instead).
- Parameters:
bitsetList - The list of bitsets to generate.maxVocabulary - Ensure that each generated bitset can contain at least this value.
genBlockFinish
private void genBlockFinish(JavaBlockFinishingInfo howToFinish,
String noViableAction)
- Generate the finish of a block, using a combination of the info
returned from genCommonBlock() and the action to perform when
no alts were taken
- Parameters:
howToFinish - The return of genCommonBlock()noViableAction - What to generate when no alt is taken
genBlockPreamble
protected void genBlockPreamble(AlternativeBlock blk)
- Generate the header for a block, which may be a RuleBlock or a
plain AlternativeBLock. This generates any variable declarations,
init-actions, and syntactic-predicate-testing variables.
-
genCases
protected void genCases(BitSet p)
- Generate a series of case statements that implement a BitSet test.
- Parameters:
p - The Bitset for which cases are to be generated
genCommonBlock
public JavaBlockFinishingInfo genCommonBlock(AlternativeBlock blk,
boolean noTestForSingle)
- Enabled: Generate common code for a block of alternatives; return a postscript
that needs to be generated at the end of the block. Other routines
may append else-clauses and such for error checking before the postfix
is generated.
If the grammar is a lexer, then generate alternatives in an order where
alternatives requiring deeper lookahead are generated first, and
EOF in the lookahead set reduces the depth of the lookahead.
- Parameters:
blk - The block to generatenoTestForSingle - If true, then it does not generate a test for a single alternative.
suitableForCaseExpression
private static boolean suitableForCaseExpression(Alternative a)
-
genElementAST
private void genElementAST(AlternativeElement el)
- Generate code to link an element reference into the AST
-
genErrorCatchForElement
private void genErrorCatchForElement(AlternativeElement el)
- Close the try block and generate catch phrases
if the element has a labeled handler in the rule
-
genErrorHandler
private void genErrorHandler(ExceptionSpec ex)
- Generate the catch phrases for a user-specified error handler
-
genErrorTryForElement
private void genErrorTryForElement(AlternativeElement el)
- Generate a try { opening if the element has a labeled handler in the rule
-
genHeader
protected void genHeader()
- Generate a header that is common to all Sather files
-
genLiteralsTest
private void genLiteralsTest()
-
genLiteralsTestForPartialToken
private void genLiteralsTestForPartialToken()
-
genMatch
protected void genMatch(BitSet b)
-
genMatch
protected void genMatch(GrammarAtom atom)
-
genMatchUsingAtomText
protected void genMatchUsingAtomText(GrammarAtom atom)
-
genMatchUsingAtomTokenType
protected void genMatchUsingAtomTokenType(GrammarAtom atom)
-
genNextToken
public void genNextToken()
- Enabled: Generate the nextToken() rule. nextToken() is a synthetic
lexer rule that is the implicit OR of all user-defined lexer
rules.
-
genRule
public void genRule(RuleSymbol s,
boolean startSymbol,
int ruleNum)
- Enabled: Gen a named rule block.
ASTs are generated for each element of an alternative unless
the rule or the alternative have a '!' modifier.
If an alternative defeats the default tree construction, it
must set _AST to the root of the returned AST.
Each alternative that does automatic tree construction, builds
up root and child list pointers in an ASTPair structure.
A rule finishes by setting the returnAST variable from the
ASTPair.
- Parameters:
startSymbol - true if the rule is a start symbol (i.e., not referenced elsewhere)
GenRuleInvocation
private void GenRuleInvocation(RuleRefElement rr)
-
genSemPred
protected void genSemPred(String pred,
int line)
-
genSemPredMap
protected void genSemPredMap()
- Write an array of Strings which are the semantic predicate
expressions. The debugger will reference them by number only
-
genSynPred
protected void genSynPred(SynPredBlock blk,
String lookaheadExpr)
-
genTokenStrings
public void genTokenStrings()
- Enabled: Generate a static array containing the names of the tokens,
indexed by the token type values. This static array is used
to format error messages so that the token identifers or literal
strings are displayed instead of the token numbers.
If a lexical rule has a paraphrase, use it rather than the
token label.
-
genTokenTypes
protected void genTokenTypes(TokenManager tm)
throws IOException
- Generate the token types Java file
IOException
getASTCreateString
public String getASTCreateString(Vector v)
- Enabled: Get a string for an expression to generate creation of an AST subtree.
- Specified by:
getASTCreateString in class CodeGenerator
- Parameters:
v - A Vector of String, where each element is an expression in the target language yielding an AST node.
getASTCreateString
public String getASTCreateString(GrammarAtom atom,
String str)
- Enabled: Get a string for an expression to generate creating of an AST node
- Specified by:
getASTCreateString in class CodeGenerator
- Parameters:
atom - The grammar node for which you are creating the nodestr - The arguments to the AST constructor
getLookaheadTestExpression
protected String getLookaheadTestExpression(Lookahead[] look,
int k)
-
getLookaheadTestExpression
protected String getLookaheadTestExpression(Alternative alt,
int maxDepth)
- Generate a lookahead test expression for an alternate. This
will be a series of tests joined by '&&' and enclosed by '()',
the number of such tests being determined by the depth of the lookahead.
-
getLookaheadTestTerm
protected String getLookaheadTestTerm(int k,
BitSet p)
- Generate a depth==1 lookahead test expression given the BitSet.
This may be one of:
1) a series of 'x==X||' tests
2) a range test using>= && <= where possible, 3) a bitset membership test for complex comparisons
- Parameters:
k - The lookahead levelp - The lookahead set for level k
getRangeExpression
public String getRangeExpression(int k,
int[] elems)
- Enabled: Return an expression for testing a contiguous renage of elements
- Parameters:
k - The lookahead levelelems - The elements representing the set, usually from BitSet.toArray().
- Returns:
- String containing test expression.
getValueString
private String getValueString(int value)
- getValueString: get a string representation of a token or char value
- Parameters:
value - The token or char value
lookaheadIsEmpty
protected boolean lookaheadIsEmpty(Alternative alt,
int maxDepth)
- Is the lookahead for this alt empty?
-
lookaheadString
private String lookaheadString(int k)
-
mangleLiteral
private String mangleLiteral(String s)
- Mangle a string literal into a meaningful token name. This is
only possible for literals that are all characters. The resulting
mangled literal name is literalsPrefix with the text of the literal
appended.
- Returns:
- A string representing the mangled literal, or null if not possible.
mapTreeId
public String mapTreeId(String idParam,
ActionTransInfo transInfo)
- Enabled: Map an identifier to it's corresponding tree-node variable.
This is context-sensitive, depending on the rule and alternative
being generated
- Specified by:
mapTreeId in class CodeGenerator
- Parameters:
idParam - The identifier name to map
- Returns:
- The mapped id (which may be the same as the input), or null if the mapping is invalid due to duplicates
mapTreeVariable
private void mapTreeVariable(AlternativeElement e,
String name)
- Given an element and the name of an associated AST variable,
create a mapping between the element "name" and the variable name.
-
setupGrammarParameters
private void setupGrammarParameters(Grammar g)
-
setupOutput
public void setupOutput(String className)
throws IOException
- Enabled: This method exists so a subclass, namely VAJCodeGenerator,
can open the file in its own evil way. JavaCodeGenerator
simply opens a text file...
IOException
getNextSatherPrefix
private static String getNextSatherPrefix()
-
extractSatherTypeOfAction
protected String extractSatherTypeOfAction(String s)
-
extractSatherIdOfAction
protected String extractSatherIdOfAction(String s,
int line)
-
processActionForTreeSpecifiers
protected String processActionForTreeSpecifiers(String actionStr,
int line,
RuleBlock currentRule,
ActionTransInfo tInfo)
- Lexically process tree-specifiers in the action.
This will replace @id and @(...) with the appropriate
function calls and/or variables.
Override the default implementation inherited from CodeGenerator
in order to instantiate the Sather's ActionLexer rather than Java's
- Overrides:
processActionForTreeSpecifiers in class CodeGenerator
-