The synergy of knowledge graphs with large language models

author：The frontier of the AI era 2024-05-15 08:58:00

Extracting valuable insights from unstructured text is a key application in the financial industry. However, this task often goes beyond simple data extraction and requires advanced reasoning skills.

The synergy of knowledge graphs with large language models

A typical example is determining the maturity date in a credit agreement, which usually involves deciphering a complex instruction such as "the maturity date should be the last business day before the third anniversary of the effective date". This level of complex inference presents a challenge for large language models (LLMs). It entails incorporating external knowledge, such as a holiday calendar, to accurately interpret and use the given instructions. An integrated knowledge graph is a promising solution with several key advantages.

The advent of Transformer has revolutionized text vectorization, achieving unprecedented precision. These embeddings encapsulate deep semantic meanings beyond previous approaches, which is why large language models (LLMs) are so great at generating text.

LLMs further demonstrate reasoning ability, albeit with limitations: the depth of their reasoning tends to decline rapidly. However, combining knowledge graphs with these vector embeddings can significantly improve reasoning skills. This synergy leverages the inherent semantic richness of embedding to elevate inference to unparalleled heights, marking a major advance in AI.

In the financial sector, LLMs are primarily used through Retrieval Enhanced Generation (RAG), a method that injects new, trained knowledge into LLMs. This process involves encoding text data, indexing for efficient retrieval, encoding queries, and using similar algorithms to obtain relevant paragraphs. These retrieved paragraphs are then used with the query as the basis for the LLM to generate a response.

This approach greatly expands the LLM's knowledge base, making it invaluable for financial analysis and decision-making. While retrieval enhancement generation marks a significant step forward, it also has limitations.

A key drawback is that channel vectors may not fully grasp the semantic intent of the query, resulting in important contexts being overlooked. This oversight occurs because embedding may not capture some of the inferred connections necessary to understand the full scope of the query.

In addition, condensing complex paragraphs into individual vectors can lead to a loss of nuance, obscuring key details distributed throughout the sentence.

There is also the fact that the matching process deals with each paragraph separately and lacks a joint analysis mechanism that can connect different facts. This absence hinders the model's ability to aggregate information from multiple sources, which is often necessary to generate comprehensive and accurate responses to information from different contexts.

There are many efforts to improve the retrieval enhancement generation framework, from optimizing block sizes to using a parent block retriever, hypothetical question embedding, and query rewriting. While these strategies provide improvements, they do not lead to revolutionary outcome changes. Another way is to bypass the retrieval enhancement generation by expanding the context window, as Google Gemini jumped to 1 million token capacity. However, this brings new challenges, including inconsistent focus and large amounts of information in the context of expansion, often with thousands of times the cost increase.

Combining knowledge graphs with dense vectors is the most promising solution. While embeddings effectively compress text of different lengths into fixed-dimensional vectors, enabling the identification of semantically similar phrases, they sometimes fail to distinguish between key nuances. For example, "cash from banks and maturities" and "cash and cash equivalents" produce almost identical vectors, suggesting that similarities ignore substantial differences. The latter includes interest-bearing entities such as "asset-backed securities" or "money market funds," while "bank maturity" refers to interest-free deposits.

A knowledge graph captures the complex interrelationships between concepts. This fosters a deeper level of contextual insight, emphasizing additional unique characteristics through connections between concepts. For example, the U.S. GAAP Knowledge Graph clearly defines the sum of "cash and cash equivalents", "bank interest-bearing deposits", and "bank maturities" as "cash and cash equivalents."

By integrating these detailed contextual clues and relationships, the knowledge graph significantly improves the reasoning capabilities of LLMs. They enable more precise multi-level inference in a single graph and facilitate federated inference across multiple graphs.

The synergy of knowledge graphs with large language models

Read on

Apple has been exposed to a big move again, self-developed device-side large language model, AI is a new way out of "revitalization"?

No wonder the previous iPhone 16 series national version of the AI function will be provided by Baidu, the original Baidu in the Chinese artificial intelligence invention patent enterprise ranking is still high. Ranked in the top 10

Apple released OpenELM, an efficient language model based on an open-source training and inference framework

Solomonov: The Prophet of Large Language Models

Large Language Model Deployment: vLLM and Quantization

Apple launches OpenELM, an efficient language model, Xiaomi plans a new car for 150,000 yuan, and AI successfully rewrites human DNA

The combination of deep learning and chemical language models is used for de novo drug design, which is published in the journal Nature

The tuyere belonging to major technology companies is here again! This large language model leads to the "new industrial revolution."

The landing of large language models Why the first step is to do customer service

OpenAI launches new large language model GPT-4o; Apple will start selling the Vision Pro in China; SoftBank sold almost all of its shares in Alibaba

探索大语言模型：理解Self Attention| 京东物流技术团队

Multi-functional RNA analysis, the RNA language model of the Baidu team was published in the journal Nature

The parameters are improved slightly, and the performance index explodes! Google: Large language models hide mysterious skills

The popularity of generative AI mobile applications is accelerating! MediaTek Dimensity chips, models, and applications are driven at lightning speed

Multi-format component-level model assembly - model king: flexible combination ● unlimited creativity

Obviously the Snapdragon chip on the PC side is not weak? Why are few manufacturers using it? Now that the performance is directly benchmarked against Apple's M3's Snapdragon XElite, the situation has not only changed, but also improved

Mobile phone into model machine! Baoshan Police: A gentleman loves money and takes it in a good way

Learn more about large language model operations (LLMOps)

Slap in the face! The domestic AI model is far stronger than you think

10 domestic large models vs. college entrance examination essay: writing AI with AI

12 domestic large models vs. college entrance examination mathematics, accidentally exploded a big bug

The last round of mathematics in the high school entrance examination is to check and fill in the gaps: auxiliary circle & hidden circle & maximum value model and its extended application

The last round of mathematics in the high school entrance examination to fill in the gaps: the Hu Bugui model and its extended application

The last round of mathematics in the high school entrance examination is to fill in the gaps: the model of the melon bean principle and its extended application

The last round of mathematics in the high school entrance examination is missing and filling: the Afch's circle maximum value model and its extended application

The final round of mathematics in the high school entrance examination is to fill in the gaps: the general's drinking horse model and its extended application

The final round of mathematics in the high school entrance examination: the Fermat point model and its extended application

Recommend an open-world object detection model: DINO 1.5

#头条创作挑战赛#Gai是现在人工智能追求的目标, which is also the essence of artificial intelligence now, the establishment of a knowledge base cannot be like an industry knowledge base

16 college entrance examination records! Use mathematical models to predict Tang Shangjun's 2024 college entrance examination scores!