# Glossary

Term Explanation
Escape The escape character prevents the executing program from recognizing the next character as a functional character. Escaping is necessary if the user wants the character to be interpreted as a normal letter in order to ignore any special meaning in the context in which they are being used.
Garbage Collection (GC) In software and information technology, garbage collection, abbreviated GC, refers to automatic memory management that minimizes the resources required by a computer program. At runtime, resources that are no longer required are identified automatically to free them up.(Source: https://en.wikipedia.org/wiki/Garbage_collection_(computer_science) (opens new window), last edited March 29, 2022)
Lucene Apache Lucene is a high-performance, open-source program library especially suitable for full text search. Lucene can index any type of documents, including texts, numbers, date values, or GPS positions. Lucene is the underlying data storage structure for the Elasticsearch search and analytics engine.
NODE/NODES An Elasticsearch server corresponds to one node.
OCR Optical character recognition OCR is the automatic recognition of text within images. Scanned documents are converted into text form using OCR.
OSGi (OSGi plug-ins) The OSGi Alliance (previously Open Services Gateway initiative) specifies a hardware-independent (opens new window) dynamic software platform (opens new window), that makes it easier to modularize and administrate (service registry) applications and services with a component model (opens new window) (bundle/service). The OSGi platform requires a Java Virtual Machine (opens new window) (JVM) to provide the OSGi framework.(Source: https://en.wikipedia.org/wiki/OSGi (opens new window), last edited March 29, 2022)
REST Representational state transfer (abbreviated REST or ReST) refers to a programming paradigm for distributed systems, in particular for web services. REST is an abstraction of the structure and behavior of the World Wide Web. It aims to create an architectural style that better represents the requirements of the modern web. REST primarily differs because it uses a uniform interface. REST is primarily focused on machine-to-machine communication.(Source: https://en.wikipedia.org/wiki/Representational_state_transfer (opens new window), last edited March 29, 2022)
Replication Replication refers to the repeated storage of the same data. In the Elasticsearch, you can set how many replicas of a shard you would like with this backup function. These are then intelligently distributed to cluster nodes. This distributes searches to several nodes. In case of node failure, data replications exist, keeping the search available without losing data.
Shard The data of an Elasticsearch search index is split into shards.
Language analyzer Elasticsearch is supplied with analyzers for the most common languages. These language analyzers perform four primary functions: they divide the text into single words, convert the tokens into lowercase letters, remove common stop words, and derive the root forms of the tokens.
Search index Elasticsearch stores all data in a search index. This index is then split into types and documents with properties.
SWAP mode/swapping In IT, swapping within the storage hierarchy refers to moving data from a higher to a lower level (opens new window) or vice versa. Swapping is intended to combine the advantages of higher levels, that is higher speed, and lower levels, that is higher capacity and less expensive memory.(Source:https://de.wikipedia.org/wiki/Swapping (opens new window), last updated March 29, 2022)
Token When creating or updating a data set, the Elasticsearch breaks down the searchable text in the data set into individual terms, referred to as tokens. The tokens are then stored in the search index. A search query is also broken down into tokens. These are then searched for in the search index. A token does not always correspond to a grammatically correct word. Instead, it is a breakdown and simplification of words meaning that searching for car also finds documents containing only the word cars.
Tomcat Apache Tomcat is an open source web server and web container that implements the specifications for Java servlets and Jakarta Server Pages (JSP), allowing web applications written in Java to be run as servlets or on a JSP basis.
YML YML is the file extension for the simplified markup language YAML. YAML is used for data serialization and is based on XML as well as the data structures in the languages Perl, Python, and C.
Last updated: June 17, 2024 at 2:07 PM