Finite automata and formal language recognition

Authors

DOI:

https://doi.org/10.70577/asce.v5i2.775

Keywords:

Automation, Determinism, Programming language, Lexicography, Character recognition

Abstract

Finite automata are considered fundamental pillars for structured information processing, as they constitute the primary tool for recognizing formal languages, specifically those classified as regular. The objective of this research is to analyze the theoretical foundations of finite automata and explain their function in the recognition of formal languages. A literature review was conducted with a critical, descriptive, and analytical approach; the PRISMA method was applied; previously published scientific articles were compiled from academic databases; and filters were applied as inclusion and exclusion criteria, taking into account the research questions and objectives. A comparison of finite automata and their relationship to formal languages ​​was made. It was found that every non-deterministic finite automaton has an equivalent deterministic finite automaton that recognizes the same language, guaranteeing that non-determinism does not increase expressive power. The robustness of formal languages ​​is based on their closure properties, which allow finite automata to move beyond the theoretical realm and be integrated into critical engineering applications. It was concluded that finite automata are consolidated as the most elementary level of the Chomsky hierarchy and regular grammars. The algebraic closure properties allow these models to be integrated into critical applications such as formal verification. In future research, it is recommended to prioritize the use of minimized deterministic finite automata in the development of lexical analyzers for compilers and pattern recognition engines.

Downloads

Download data is not yet available.

References

Allamanis, M., Barr, E. T., Devanbu, P., & Sutton, C. (2019). A Survey of Machine Learning for Big Code and Naturalness. ACM Computing Surveys, 51(4), 1-37. https://doi.org/10.1145/3212695

Broy, M., Brucker, A. D., Fantechi, A., Gleirscher, M., Havelund, K., Kuppe, M. A., Mendes, A., Platzer, A., Ringert, J. O., & Sullivan, A. (2025). Does Every Computer Scientist Need to Know Formal Methods? Formal Aspects of Computing, 37(1), 1-17. https://doi.org/10.1145/3670795

Cummins, C., Seeker, V., Grubisic, D., Elhoushi, M., Liang, Y., Roziere, B., Gehring, J., Gloeckle, F., Hazelwood, K., Synnaeve, G., & Leather, H. (2023). Large Language Models for Compiler Optimization (Versión 1). arXiv. https://doi.org/10.48550/ARXIV.2309.07062

Dhayalkar, S. R. (2025). Neural Networks as Universal Finite-State Machines: A Constructive Deterministic Finite Automaton Theory (Versión 2). arXiv. https://doi.org/10.48550/ARXIV.2505.11694

Dora, J. R., Hluchý, L., & Staňo, M. (2025). In-Memory Shellcode Runner Detection in Internet of Things (IoT) Networks: A Lightweight Behavioral and Semantic Analysis Framework. Sensors, 25(17), 5425. https://doi.org/10.3390/s25175425

Figueiras, P., Ioannou, G., Lambri, C., Reji, S., Mossali, E., Neviani, M. I., Koussouris, S., Bountouni, N., In Cugurra, M. D. B., Hellbach, R., Bibikas, D., O’Brien, P., Agostinho, C., & Jardim-Gonçalves, R. (2026). Roadmaps and Agendas for Research and Innovation in Artificial Intelligence, Data, and Robotics. En E. Curry, P. Piatkiewicz, F. Heintz, H. Vornhagen, A. N. Belbachir, E. Girardi, M. Schoenauer, & J. Röning (Eds.), Artificial Intelligence, Data and Robotics (pp. 547-579). Springer Nature Switzerland. https://doi.org/10.1007/978-3-032-10561-5_19

Gambhire, S., Panhalkar, A. R., Patil, D. H., Kedar, S., Kumar, M., & Panchal, B. Y. (2026). Leveraging formal languages and automata theory in natural language processing (NLP). Journal of Discrete Mathematical Sciences & Cryptography, 29(2-B), 995-1004. https://doi.org/10.47974/JDMSC-2552

Hou, Z. (2021). Automata Theory and Formal Languages. En Z. Hou, Fundamentals of Logic and Computation (pp. 119-162). Springer International Publishing. https://doi.org/10.1007/978-3-030-87882-5_4

Kamble, U., Nalawade, S., Mahadik, T., Salunke, Y., Dedgaonkar, S., & Shelke, P. (2024). Survey of Application of Automata Theory in Natural Language Processing. 2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS), 1-4. https://doi.org/10.1109/ICETSIS61505.2024.10459556

Karakonstantis, I., & Xylomenos, G. (2026). A Review of Two-Dimensional Cellular Automata Models for Wildfire Simulation: Methods, Capabilities, and Limitations. Fire, 9(3), 108. https://doi.org/10.3390/fire9030108

Kumar, A., Awasthy, N., Jose, A. S., Ramesh, G., Gupta, M., Srinija, K., & Najm, R. (2025). The Intersection of Algorithms, Automata Theory, and Formal Languages in Computational Theory. 2025 International Conference on Intelligent Control, Computing and Communications (IC3), 691-697. https://doi.org/10.1109/IC363308.2025.10956289

Morazán, M. T., & Minić, T. (2024). Finite-State Automaton To/From Regular Expression Visualization. https://doi.org/10.48550/ARXIV.2407.08088

Nagy, B. (2022). From Finite Automata to Fractal Automata – The Power of Recursion. En J. Durand-Lose & G. Vaszil (Eds.), Machines, Computations, and Universality (Vol. 13419, pp. 109-125). Springer International Publishing. https://doi.org/10.1007/978-3-031-13502-6_8

Negrini, L., Arceri, V., Cortesi, A., & Ferrara, P. (2024). TARSIS: An effective automata‐based abstract domain for string analysis. Journal of Software: Evolution and Process, 36(8), e2647. https://doi.org/10.1002/smr.2647

Nicol, J., & Frohme, M. (2025). Deconstructing Subset Construction—Reducing While Determinizing (Versión 2). arXiv. https://doi.org/10.48550/ARXIV.2505.10319

Postiglione, A. (2024). Finite State Automata on Multi-Word Units for Efficient Text-Mining. Mathematics, 12(4), 506. https://doi.org/10.3390/math12040506

Seshia, S. A., Sharygina, N., & Tripakis, S. (2018). Modeling for Verification. En E. M. Clarke, T. A. Henzinger, H. Veith, & R. Bloem (Eds.), Handbook of Model Checking (pp. 75-105). Springer International Publishing. https://doi.org/10.1007/978-3-319-10575-8_3

Urbat, H., & Schröder, L. (2020). Automata Learning: An Algebraic Approach. Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science, 900-914. https://doi.org/10.1145/3373718.3394775

Yadav, A., Patel, A., & Shah, M. (2021). A comprehensive review on resolving ambiguities in natural language processing. AI Open, 2, 85-92. https://doi.org/10.1016/j.aiopen.2021.05.001

Yokomori, T., & Okubo, F. (2021). Theory of reaction automata: A survey. Journal of Membrane Computing, 3(1), 63-85. https://doi.org/10.1007/s41965-021-00070-6

Yue, J., Yan, Y., & Chen, Z. (2019). Language acceptability of finite automata based on theory of semi‐tensor product of matrices. Asian Journal of Control, 21(6), 2634-2643. https://doi.org/10.1002/asjc.2190

Published

2026-04-22

How to Cite

Sarabia Martínez , A. Y. (2026). Finite automata and formal language recognition. ANNALS SCIENTIFIC EVOLUTION, 5(2), 453–472. https://doi.org/10.70577/asce.v5i2.775

Similar Articles

1 2 3 4 5 6 > >> 

You may also start an advanced similarity search for this article.