序言

Elasticsearch(名稱太長，後面簡稱ES)作為一個搜尋引擎，目前可謂是如日中天，幾乎和solr齊駕并驅。關于他能做什麼，跟雲計算有什麼關系，在此不再描述。但是ES的官方文檔，特别是關于java的用戶端文檔，真是少的可憐，甚至連個完整的增删改的示例都沒有。在此，我就獻醜了。

在開始講解之前，還是先做個鋪墊，為了能夠有一個可以索引的模型，我們自定義了一個模型，暫時起個名稱叫LogModel吧，這個模型有各種資料類型，int,long,String,list，但千萬不要認為這是跟記錄日志有關的一個模型。作為索引的一個最簡單模型。代碼如下：

import java.util.ArrayList;  
import java.util.List;  
import java.util.Random;  
import java.util.UUID;  
/** 
 * 瞎編的一個模型，跟日志基本沒有關系 
 * @author donlian 
 */  
public class LogModel {  
    //主ID  
    private long id;  
    //次ID  
    private int subId;  
    /** 
     * 系統名稱 
     */  
    private String systemName;  
    private String host;  

    //日志描述  
    private String desc;  
    private List<Integer> catIds;  
    public LogModel(){  
        Random random = new Random();  
        this.id = Math.abs(random.nextLong());  
        int subId = Math.abs(random.nextInt());  
        this.subId = subId;  
        List<Integer> list = new ArrayList<Integer>();  
        for(int i=;i<;i++){  
            list.add(Math.abs(random.nextInt()));  
        }  
        this.catIds = list;  
        this.systemName = subId% == ?"oa":"cms";  
        this.host = subId% == ?"10.0.0.1":"10.2.0.1";  
        this.desc = "中文" + UUID.randomUUID().toString();  
    }  
    public LogModel(long id,int subId,String sysName,String host,String desc,List<Integer> catIds){  
        this.id = id;  
        this.subId = subId;  
        this.systemName = sysName;  
        this.host = host;  
        this.desc = desc;  
        this.catIds = catIds;  
    }  
...//省去get,set方法  
}

同時，因為ES在索引的時候，一般都用json格式，是以，使用jackson定義了一個将對象轉化成json的工具類，也很簡單，代碼：

public class ESUtils {  
    private static ObjectMapper objectMapper = new ObjectMapper();  
    public static String toJson(Object o){  
        try {  
            return objectMapper.writeValueAsString(o);  
        } catch (JsonProcessingException e) {  
            e.printStackTrace();  
        }  
        return "";  
    }  
}

在開始進行操作ES伺服器之前，我們必須得獲得ES的API，簡單介紹一下ES操作伺服器的兩種方式，一種是使用Node方式，即本機也啟動一個ES，然後跟伺服器的ES進行通信，這個node甚至還能存儲（奇怪，一般需要這樣的方式嗎？），另一種，就是下面我介紹的這一種，通過一個對象使用http協定跟伺服器進行互動。

獲得一個ES用戶端API的代碼如下：

Settings settings = ImmutableSettings.settingsBuilder()  
                //指定叢集名稱  
                .put("cluster.name", "elasticsearch")  
                //探測叢集中機器狀态  
                .put("client.transport.sniff", true).build();  
        /* 
         * 建立用戶端，所有的操作都由用戶端開始，這個就好像是JDBC的Connection對象 
         * 用完記得要關閉 
         * 注意client裡面包含了連接配接池，對于client而言用完之後需要關閉，但是針對連接配接而言不需要關閉
         */  
        Client client = new TransportClient(settings)  
        .addTransportAddress(new InetSocketTransportAddress("192.168.1.106", ));

Client對象，可以了解為資料庫的Connection對象。好了，準備工作完成，下面就開始增删改查。

Client更類似與連接配接池

Index(增加）

ES裡面的增加對象不叫什麼add,save等，叫index。但無論叫什麼名稱，反正就是向ES伺服器裡面加資料。上面說過一個對象轉json的工具類，其實ES的API中，是自帶建構json的工具類的。

import org.elasticsearch.action.index.IndexResponse;  
import org.elasticsearch.client.Client;  
import org.elasticsearch.client.transport.TransportClient;  
import org.elasticsearch.common.settings.ImmutableSettings;  
import org.elasticsearch.common.settings.Settings;  
import org.elasticsearch.common.transport.InetSocketTransportAddress;  

import com.donlianli.es.ESUtils;  
import com.donlianli.es.model.LogModel;  
/** 
 * 向ES添加索引對象 
 * @author donlian 
 */  
public class IndexTest {  
    public static void main(String[] argv){  
        Settings settings = ImmutableSettings.settingsBuilder()  
                //指定叢集名稱  
                .put("cluster.name", "elasticsearch")  
                //探測叢集中機器狀态  
                .put("client.transport.sniff", true).build();  
        /* 
         * 建立用戶端，所有的操作都由用戶端開始，這個就好像是JDBC的Connection對象 
         * 用完記得要關閉 
         */  
        Client client = new TransportClient(settings)  
        .addTransportAddress(new InetSocketTransportAddress("192.168.1.106", ));  
        String json = ESUtils.toJson(new LogModel());  
        //在這裡建立我們要索引的對象  
        IndexResponse response = client.prepareIndex("twitter", "tweet")  
                //必須為對象單獨指定ID  
                .setId("1")  
                .setSource(json)  
                .execute()  
                .actionGet();  
        //多次index這個版本号會變  
        System.out.println("response.version():"+response.version());  
        client.close();  
    }  
}

運作這個代碼，就向ES插入了一條資料，你運作兩遍，還是一條。ES根據你設定的ID來設定對象，如果沒有則插入，有則更新。每更新一次，對應的version加1.

好了，在次，使用以下指令，應該能夠查詢到一條記錄了。

curl -XGET 'http://localhost:9200/twitter/tweet/1'

delete(删除）

有了增加的例子，删除的例子也就好寫了。增加是prepareIndex，删除是prepareDelete,查詢就是PrepareGet。

代碼如下：

import org.elasticsearch.action.delete.DeleteResponse;  
import org.elasticsearch.client.Client;  
import org.elasticsearch.client.transport.TransportClient;  
import org.elasticsearch.common.settings.ImmutableSettings;  
import org.elasticsearch.common.settings.Settings;  
import org.elasticsearch.common.transport.InetSocketTransportAddress;  

import com.donlianli.es.ESUtils;  

public class DeleteTest {  
    public static void main(String[] argv){  
        Settings settings = ImmutableSettings.settingsBuilder()  
                //指定叢集名稱  
                .put("cluster.name", "elasticsearch")  
                //探測叢集中機器狀态  
                .put("client.transport.sniff", true).build();  
        /* 
         * 建立用戶端，所有的操作都由用戶端開始，這個就好像是JDBC的Connection對象 
         * 用完記得要關閉 
         */  
        Client client = new TransportClient(settings)  
        .addTransportAddress(new InetSocketTransportAddress("192.168.1.106", ));  
        //在這裡建立我們要索引的對象  
        DeleteResponse response = client.prepareDelete("twitter", "tweet", "1")  
                .execute().actionGet();  
        System.out.println(response.getId());  
        System.out.println(ESUtils.toJson(response.getHeaders()));  
    }  
}

GET(查詢）

import org.elasticsearch.action.get.GetResponse;  
import org.elasticsearch.client.Client;  
import org.elasticsearch.client.transport.TransportClient;  
import org.elasticsearch.common.settings.ImmutableSettings;  
import org.elasticsearch.common.settings.Settings;  
import org.elasticsearch.common.transport.InetSocketTransportAddress;  

public class GetTest {  
    public static void main(String[] argv){  
        Settings settings = ImmutableSettings.settingsBuilder()  
                //指定叢集名稱  
                .put("cluster.name", "elasticsearch")  
                //探測叢集中機器狀态  
                .put("client.transport.sniff", true).build();  
        /* 
         * 建立用戶端，所有的操作都由用戶端開始，這個就好像是JDBC的Connection對象 
         * 用完記得要關閉 
         */  
        Client client = new TransportClient(settings)  
        .addTransportAddress(new InetSocketTransportAddress("192.168.1.106", ));  
        //在這裡建立我們要索引的對象  
        GetResponse response = client.prepareGet("twitter", "tweet", "1")  
                .execute().actionGet();  
        System.out.println("response.getId():"+response.getId());  
        System.out.println("response.getSourceAsString():"+response.getSourceAsString());  
    }  
}

好了，增删改查的代碼寫完。至于搜尋，那是一個比較深入的話題，我也在慢慢探索。我時間我會繼續寫下去。

注

對于ES的Java Client我覺得更多的是類似于JDBC連接配接池的概念。你不能在程式裡不停的new client，所有的連接配接都可以使用同一個client來操作，類似于我們在JDBC裡每一次的查詢都會使用同一個連接配接池。極端情況下不斷的執行個體化client會導緻伺服器達到最大連接配接數，進而是應用報錯。根據經驗，預設情況下client保留的連接配接數是15。

如有不同見解，歡迎留言讨論！

原文出自http://donlianli.iteye.com/blog/1902238

本文已收錄于以下專欄：
Elasticsearch實踐

【Elasticsearch】Java Client入門序言

happydream_C
2017-09-11 17:221樓回複
部落客您好，有個es問題想請教下，在javaweb程式伺服器啟動後，首次從es查詢資料時擷取TransportClient都會比較久，後面查詢重新擷取TransportClient會相對快點，不知道具體是什麼原因造成的，部落客有解決過相同的問題麻煩分享下問題原因及解決方案，非常感謝。

擷取TransportClient代碼如下：

public static Client getClient() throws UnknownHostException {

Client client = null;

try {

if (StringUtils.isEmpty(ip)) {

ip = def_ip;

}

if (port == 0) {

port = def_port;

}

client = TransportClient.builder().build().addTransportAddress(new InetSocketTransportAddress(InetAddress.getByName(ip), port));

}

catch (Exception e) {

e.printStackTrace();

}

return client;

}

【Elasticsearch】Java Client入門序言

序言

Index(增加）

delete(删除）

GET(查詢）

注

happydream_C

繼續閱讀

資料采集學習劄記

搭建elk日志系統（持續更新中）

ElasticSearch叢集健康值叢集健康值GET _cluster/health參考文獻

ElasticSearch--優化寫入速度的方法--修改配置

kibana 常見錯誤處理

極易上手搭建自己日志采集伺服器分析日志（winlogbeat+Elasticsearch+Kibana)

EFK 監控postgresql日志

logstash使用grok正則解析日志和kibana遇到的問題

elk筆記4--grok正則解析1 grok 切分方法2 grok 切分案例3 說明

ELK大規模日志實時處理系統理論分析一

CentOS6.8安裝ELK日志平台

elk錯誤：Too Many Requests

SpringCloud-鍊路追蹤sleuth

ELK日志分析系統原理與部署一、ELK日志分析系統簡介二、三款軟體各自概念三、ELK日志分析系統部署

Elasticsearch 使用copy_to組合字段進行查詢copy_to 介紹

ELK詳細分析nginx日志1. 實驗規劃2. ELK安裝3. logstash伺服器的配置4. kibana檢視配置5. 建立儀表闆顯示pv和uv

【Elasticsearch】Java Client入門 序言

序言

Index(增加）

delete(删除）

GET(查詢）

注

happydream_C

繼續閱讀

【Elasticsearch】Java Client入門序言