LEXICAL ANALYZER

IRJET  Journal

Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.

Log In

Password

Remember me on this computer

or reset password

Enter the email address you signed up with and we'll email you a reset link.

Need an account? Click here to sign up

LEXICAL ANALYZER

IRJET Journal

2022, IRJET

An The process of turning a string of letters into a string of tokens is known as lexical analysis, commonly referred to as lexing or tokenization. These tokens may be keywords, identifiers, constants, operators, or other language-specific symbols. The word lexical is obtained from the native word i.e. lexeme which means tokens. The process of lexical analysis usually includes reading each character of the input one by one, grouping characters into tokens, and passing these tokens to a parser or other program for further processing. Lexical analysis is often first step in the operation of compiling or interpreting a program. It is also used in natural language processing, information retrieval, and other fields where it is necessary to identify and classify the elements of a body of text. In general, lexical analysis involves breaking up a stream of text into a sequence of tokens, which can then be further processed and analyzed by other programs. It is an important step in the compilation and interpretation of programming languages, as well as in the processing of natural language.

JD A., Srinivas Publication

International Journal of Applied Engineering and Management Letters (IJAEML), 2020

The term “lexical” in lexical analysis process of the compilation is derived from the word“lexeme”, which is the basic conceptual unit of the linguistic morphological study. In computerscience, lexical analysis, also referred to as lexing, scanning or tokenization, is the process oftransforming the string of characters in source program to a stream of tokens, where the tokenis a string with a designated and identified meaning. It is the first phase of a two-stepcompilation processing model known as the analysis stage of compilation process used bycompiler to understand the input source program. The objective is to convert character streamsinto words and recognize its token type. The generated stream of tokens is then used by theparser to determine the syntax of the source program. A program in compilation phase thatperforms a lexical analysis process is termed as lexical analyzer, lexer, scanner or tokenizer.Lexical analyzer is used in various computer science applications, such as word processing, information retrieval systems, pattern recognition systems and language-processing systems.However, the scope of our review study is related to language processing. Various tools areused for automatic generation of tokens and are more suitable for sequential execution of theprocess. Recent advances in multi-core architecture systems have led to the need to re-engineerthe compilation process to integrate the multi-core architecture. By parallelization in therecognition of tokens in multiple cores, multi cores can be used optimally, thus reducingcompilation time. To attain parallelism in tokenizationon multi-core machines, the lexicalanalyzer phase of compilation needs to be restructured to accommodate the multi-corearchitecture and by exploiting the language constructs which can run parallel and the conceptof processor affinity. This paper provides a systematic analysis of literature to discuss emergingapproaches and issues related to lexical analyzer implementation and the adoption of mprovedmethodologies. This has been achieved by reviewing 30 published articles on theimplementation of lexical analyzers. The results of this review indicate various techniques,latest developments, and current approaches for implementing auto generated scanners andhand-crafted scanners. Based on the findings, we draw on the efficacy of lexical analyzerimplementation techniques from the results discussed in the selected review studies and thepaper provides future research challenges and needs to explore the previously under-researchedareas for scanner implementation processes.

Log In

LEXICAL ANALYZER

Related papers

Related papers

Related topics