Cloud NiFi Processors
  • Calculated Systems NiFi Processors
  • Amazon Web Services
    • Comprehend API
      • DetectDominantLanguage Processor
      • DetectKeyPhrases Processor
      • DetectEntities Processor
        • Entity Types
      • DetectSentiment Processor
      • DetectSyntax Processor
    • Textract API
      • DetectDocumentText Processor
      • AnalyzeDocument Processor
      • Block Types
  • Google Cloud Platform
    • Natural Language API
      • AnalyzeSyntax Processor
      • AnalyzeEntities Processor
      • AnalyzeSentiment Processor
      • AnalyzeEntitiesWithSentiment Processor
      • ClassifyText Processor
      • AnnotateText Processor
      • Entity Types
      • The Metadata Field
Powered by GitBook
On this page
  • Properties
  • Data Output

Was this helpful?

  1. Google Cloud Platform
  2. Natural Language API

AnalyzeSyntax Processor

Part of the GCP Natural Language processor family

PreviousNatural Language APINextAnalyzeEntities Processor

Last updated 5 years ago

Was this helpful?

This processor analyzes an input string and breaks it down into sentences and tokens (which, in most cases, are words). Each token has a plethora of information about it, including tense, quantity, person, and many more. An extensive list is provided in the Data Output section.

Properties

All of our Natural Language processors also include these .

This processor does not have any unique properties outside of the common ones.

Data Output

Field Name

Data Type

Description

tokens

array of Token

The list of the tokens parsed by the API

sentences

array of Sentence

The list of the sentences parsed by the API

language

string

The language code of the language the input string is in

Token

Field Name

Data Type

Description

text

string

The raw text of this token

lemma

string

The root/bare version of this word. Read more about lemma .

beginOffset

int

The number of characters from the beginning of the input string to the beginning of this token

partOfSpeech

PartOfSpeech

A comprehensive list of all of the syntactical information pertaining to this token, such as gender, quantity, person, etcetera.

dependencyEdge

DependencyEdge

Represents dependency tree information for this token. Read more .

Sentence

Field Name

Data Type

Description

text

string

The raw text of this sentence

beginOffset

int

The number of characters from the beginning of the input string to the beginning of this sentence

DependencyEdge

Field Name

Data Type

Description

headTokenIndex

int

Represents the head of the token in the dependency tree

label

string ()

The parse label for the token

PartOfSpeech

Field Name

Data Type

Description

aspect

string ()

The characteristic of a verb that expresses time flow during an event

case

string ()

The grammatical function performed by a noun or pronoun in a phrase, clause, or sentence

form

string ()

Depending on the language, Form can be categorizing different forms of verbs, adjectives, adverbs, etcetera

gender

string ()

Gender classes of nouns reflected in the behavior of associated words

mood

string ()

The grammatical feature of verbs, used for showing modality and attitude

number

string ()

Count distinctions

person

string ()

The distinction between the speaker, second person, third person, etcetera

proper

string ()

Distinguishes if the token is part of a proper name

reciprocity

string ()

Reciprocal features of a pronoun

tag

string ()

A shorthand-tag for what part of speech the token is

tense

string ()

Time reference

voice

string ()

The relationship between the action that a verb expresses and the participants identified by its arguments

// This output has been heavily edited for brevity

{
   "output":{
      "sentences":[
         {
            "text":"James Adams is a fantastic developer who has been on the scene for over ten years.",
            "beginOffset":-1
         },
         {
            "text":"His excellent attitude and steady work ethic contribute to his character.",
            "beginOffset":-1
         },
         {
            "text":"May he stay with our company for many years to come.",
            "beginOffset":-1
         }
      ],
      "tokens":[
         {
            "dependencyEdge":{
               "label":"NN",
               "headTokenIndex":1
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER",
               "tag":"NOUN",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"James",
            "text":"James",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"ATTR",
               "headTokenIndex":2
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"NOUN",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"developer",
            "text":"developer",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"NSUBJ",
               "headTokenIndex":8
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"THIRD",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"PRON",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"who",
            "text":"who",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"AUX",
               "headTokenIndex":8
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"INDICATIVE",
               "person":"THIRD",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"VERB",
               "tense":"PRESENT",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"have",
            "text":"has",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"RCMOD",
               "headTokenIndex":5
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"VERB",
               "tense":"PAST",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"be",
            "text":"been",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"PREP",
               "headTokenIndex":8
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"ADP",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"on",
            "text":"on",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"DET",
               "headTokenIndex":11
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"DET",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"the",
            "text":"the",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"POBJ",
               "headTokenIndex":9
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"NOUN",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"scene",
            "text":"scene",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"PREP",
               "headTokenIndex":8
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"ADP",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"for",
            "text":"for",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"ADVMOD",
               "headTokenIndex":14
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"ADP",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"over",
            "text":"over",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"NUM",
               "headTokenIndex":15
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"NUM",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"ten",
            "text":"ten",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"POBJ",
               "headTokenIndex":12
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"PLURAL",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"NOUN",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"year",
            "text":"years",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"P",
               "headTokenIndex":2
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"PUNCT",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":".",
            "text":".",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"ROOT",
               "headTokenIndex":24
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"VERB",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"contribute",
            "text":"contribute",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"PREP",
               "headTokenIndex":24
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"ADP",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"to",
            "text":"to",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"POSS",
               "headTokenIndex":27
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"MASCULINE",
               "mood":"MOOD_UNKNOWN",
               "person":"THIRD",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"PRON",
               "tense":"TENSE_UNKNOWN",
               "case":"GENITIVE",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"his",
            "text":"his",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"POBJ",
               "headTokenIndex":25
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"NOUN",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"character",
            "text":"character",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"P",
               "headTokenIndex":24
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"NUMBER_UNKNOWN",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"PUNCT",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":".",
            "text":".",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"AUX",
               "headTokenIndex":31
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"GENDER_UNKNOWN",
               "mood":"MOOD_UNKNOWN",
               "person":"PERSON_UNKNOWN",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"VERB",
               "tense":"TENSE_UNKNOWN",
               "case":"CASE_UNKNOWN",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"May",
            "text":"May",
            "beginOffset":-1
         },
         {
            "dependencyEdge":{
               "label":"NSUBJ",
               "headTokenIndex":31
            },
            "partOfSpeech":{
               "voice":"VOICE_UNKNOWN",
               "number":"SINGULAR",
               "form":"FORM_UNKNOWN",
               "gender":"MASCULINE",
               "mood":"MOOD_UNKNOWN",
               "person":"THIRD",
               "aspect":"ASPECT_UNKNOWN",
               "proper":"PROPER_UNKNOWN",
               "tag":"PRON",
               "tense":"TENSE_UNKNOWN",
               "case":"NOMINATIVE",
               "reciprocity":"RECIPROCITY_UNKNOWN"
            },
            "lemma":"he",
            "text":"he",
            "beginOffset":-1
         },
      ],
      "language":"en"
   },
   "raw-input":"James Adams is a fantastic developer who has been on the scene for over ten years. His excellent attitude and steady work ethic contribute to his character. May he stay with our company for many years to come."
}
common properties
here
here
DependencyEdgeLabel
Aspect
Case
Form
Gender
Mood
Number
Person
Proper
Reciprocity
Tag
Tense
Voice