Second understanding: Illustrating artificial intelligence training and working process (Chinese flowchart)

author：A new impetus for AI 2023-06-26 13:08:00

In order to facilitate everyone to understand the process of AI training from the whole and avoid too many concepts affecting the overall understanding, we drew this flowchart.

Second understanding: Illustrating artificial intelligence training and working process (Chinese flowchart)

Diagram of the AI training process

Data collection: Collect data for training and output raw data (unprocessed dataset)
Data preprocessing: Cleansing and formatting data, and outputting preprocessed data (data suitable for model training)
Model initialization: set the initial parameters of the model, output the initialization model (the model with the initial parameters)
Model training: Train a model with data and output the trained model (a model trained after one round)
Model evaluation: evaluate the performance of the model and output the evaluation results (performance indicators of the model)
Model optimization: Optimize the model according to the evaluation results, output the optimized model (optimized model)
Model deployment: Apply the model to the actual environment and output the final model (the model that meets the requirements)

The following figure is a simple diagram of the ChatGPT workflow:

ChatGPT workflow diagram

1. User input: The original input provided by the user.

Second, input processing: process the user's input and prepare it for feeding into the model.

Tokenizer: Breaks down the input text into small pieces (tokens) that can be understood by the model.
Tokenization: The process of converting input text into tokens.

ChatGPT model: Receive the processed input and start generating a reply.

Transformer model: The core part of the ChatGPT model for understanding input and generating replies.
Self-attention mechanism: A key part of the Transformer model for understanding the relationship between the various parts in the input.
Multi-head attention: Part of the self-attention mechanism that allows the model to focus on multiple parts of the input at the same time.
Zoom dot attention: a portion of the bull's attention, calculating the relationship between the input parts.
Generate replies: Generate responses based on understood input and learned patterns.

4. Output processing: Process the output generated by the model and prepare it for display to users.

Inverse tokenizer: Reassemble the tokens generated by the model into human-readable text.
Detokenization: The process of converting tokens back to text.

5. The user sees the reply: The user sees the final reply.

Sample token

Second understanding: Illustrating artificial intelligence training and working process (Chinese flowchart)

Read on