Processing
Language Models
Language models are advanced AI systems that can understand, interpret, and generate human language. These models are trained on enormous text datasets and learn patterns, word combinations, sentence structures, and even the subtleties of different languages and language uses. The core of many modern language models is the transformer architecture, which uses self-attention mechanisms to determine which parts of the text are important in a given context.
In language processing, these models use statistical methods to predict the most likely next word or sentence in a text. They can understand context across long pieces of text and thereby generate not only grammatically correct but also contextually coherent and relevant texts.
When using a language model for something like a chatbot or text generator, the model is given certain prompts or initial data, and based on that input, the model generates text that logically follows from the given context. The goal of these models is to produce text that appears as human-like as possible, both in terms of content and style.
Text-to-Image Models
Text-to-image models are AI systems capable of generating visual representations from written textual descriptions, such as photos, illustrations, or other types of imagery. These models use advanced neural networks, more specifically generative adversarial networks (GANs) or variations like diffusion models.
The process begins with a text description entered by a user. The model assesses this text and tries to understand its meaning and context. It then generates images that correspond to the textual description, using what it has learned during training, which involves being trained on massive datasets of text-image pairs.
During training, the model learns associations between textual descriptions and visual features. For example, if the model repeatedly sees the word combination 'a yellow sun above a blue sea' together with images illustrating this scenario, it learns to recognize and reproduce these elements in future image creations.
The result is often surprisingly accurate and detailed images that align with the entered text description. These models are becoming increasingly refined and are able to represent complex scenarios with multiple objects and abstract concepts. They are used in a wide range of applications, including artistic creation, game design, virtual reality, and more.
AI-Public Unlocks Models
It's important to understand that AI-Public unlocks various AI models offered by large technology companies via an API. An API, or Application Programming Interface, is a set of rules and definitions that allow software programs to communicate with each other. It functions like a 'language' understood by programs to exchange information and invoke each other's functions. AI-Public itself does not have language models or text-to-image models.
We are not responsible for the results of the different models. However, we have paid attention to selecting the best and most interesting models for businesses.
Processing Procedure
The following procedure is followed to generate a response:
- The user creates a prompt.
- The front-end web application links this to the active chat and adds a chat message with status "Initializing".
- A function is triggered on AI-Public servers by adding a chat message.
- The chat message status is set to "Processing".
- When selecting a chat with documents, the server first sends a request to the Firestore vector database to select texts from documents.
- The server then sends the request via an API connection to the selected language model.
- If the Streaming setting is on, we save the message after every 10 received chunks and after every 25 chunks after receiving 100 chunks.
- Once the entire response is received, the status is set to "Completed".
- The front-end application is refreshed after each database update.
- If errors are detected, the status is set to "Error" and an error message is displayed.
We do not send any personal data with each API request. However, the user may have included personal data in the prompt or in uploaded documents.