Elasticsearch: Difference between "Term", "Match Phrase", and "Query String"

Elasticsearch

Elasticsearch Problem Overview


New here to Elasticsearch and trying to get a better understanding on the difference between these queries. As far as I can tell, term matches a single term (needs to be lowercase for the match to work?), and both match phrase and query string matches a string of text.

Elasticsearch Solutions


Solution 1 - Elasticsearch

term query matches a single term as it is : the value is not analyzed. So, it doesn't have to be lowercased depending on what you have indexed.

If you provided Bennett at index time and the value is not analyzed, the following query won't return anything :

{
  "query": {
    "term" : { "user" : "bennett" }
  }
}

match_phrase query will analyze the input if analyzers are defined for the queried field and find documents matching the following criteria:

  • all the terms must appear in the field
  • they must have the same order as the input value
  • there must not be any intervening terms, i.e. be consecutive (potentially excluding stop-words but this can be complicated)

For example, if you index the following documents (using standard analyzer for the field foo):

{ "foo":"I just said hello world" }

{ "foo":"Hello world" }

{ "foo":"World Hello" }

{ "foo":"Hello dear world" }

This match_phrase query will only return the first and second documents :

{
  "query": {
    "match_phrase": {
      "foo": "Hello World"
    }
  }
}

query_string query search, by default, on a _all field which contains the text of several text fields at once. On top of that, it's parsed and supports some operators (AND/OR...), wildcards and so on (see related syntax).

As the match_phrase queries, the input is analyzed according to the analyzer set on the queried field.

Unlike the match_phrase, the terms obtained after analysis don't have to be in the same order, unless the user has used quotes around the input.

For example, using the same documents as before, this query will return all the documents :

{
  "query": {
    "query_string": {
      "query": "hello World"
    }
  }
}

But this query will return the same 2 documents as the match_phrase query :

{
  "query": {
    "query_string": {
      "query": "\"Hello World\""
    }
  }
}

There is much more to say about the different options for those queries, please take a look at the related documentation :

Hope this is clear enough and it will help.

Solution 2 - Elasticsearch

I think some one definitely looking for differences between them with respect to PARTIAL SEARCH Here is my analysis with default ‘standard analyzer’ :-

> Suppose ,We have data :-

{ "name" : “Hello”}

Now what if we want to do partial search with ell ???

> Term Query OR Match query

{"term":{"name": "*ell*" }

Will not work , return noting .

{"term":{"name": "*zz* *ell*" }

Will not work , return noting .

Conclusion - Term or Match is not able to do partial search at all

> wildcard Query :-

{"wildcard":{"name": "*ell*" }

Will work give result { "name" : "Hello"}

{"wildcard":{"name": "*zz* *ell*" }

Will not work , return noting .

Conclusion - wildcard is able to do partial search with one token only

> Query_string :-

{"query_string": {"default_field": "name","query": "*ell*"}

Will work give result { "name" : “Hello”}

{"query_string": {"default_field": "name","query": "*zz* *ell*" }

Will work give result { "name" : “Hello”} .

Conclusion - query_string is able to search with two token are given

-> here token are ell and zz

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
Questionblee908View Question on Stackoverflow
Solution 1 - ElasticsearchThomasCView Answer on Stackoverflow
Solution 2 - ElasticsearchVijayView Answer on Stackoverflow