AnalyzeSyntax Processor
Part of the GCP Natural Language processor family
This processor analyzes an input string and breaks it down into sentences and tokens (which, in most cases, are words). Each token has a plethora of information about it, including tense, quantity, person, and many more. An extensive list is provided in the Data Output section.
Properties
All of our Natural Language processors also include these common properties.
This processor does not have any unique properties outside of the common ones.
Data Output
Field Name | Data Type | Description |
tokens | array of Token | The list of the tokens parsed by the API |
sentences | array of Sentence | The list of the sentences parsed by the API |
language | string | The language code of the language the input string is in |
Token
Token
Field Name | Data Type | Description |
text | string | The raw text of this token |
lemma | string | The root/bare version of this word. Read more about lemma here. |
beginOffset | int | The number of characters from the beginning of the input string to the beginning of this token |
partOfSpeech | PartOfSpeech | A comprehensive list of all of the syntactical information pertaining to this token, such as gender, quantity, person, etcetera. |
dependencyEdge | DependencyEdge | Represents dependency tree information for this token. Read more here. |
Sentence
Sentence
Field Name | Data Type | Description |
text | string | The raw text of this sentence |
beginOffset | int | The number of characters from the beginning of the input string to the beginning of this sentence |
DependencyEdge
DependencyEdge
Field Name | Data Type | Description |
headTokenIndex | int | Represents the head of the token in the dependency tree |
label | string (DependencyEdgeLabel) | The parse label for the token |
PartOfSpeech
PartOfSpeech
Field Name | Data Type | Description |
aspect | string (Aspect) | The characteristic of a verb that expresses time flow during an event |
case | string (Case) | The grammatical function performed by a noun or pronoun in a phrase, clause, or sentence |
form | string (Form) | Depending on the language, Form can be categorizing different forms of verbs, adjectives, adverbs, etcetera |
gender | string (Gender) | Gender classes of nouns reflected in the behavior of associated words |
mood | string (Mood) | The grammatical feature of verbs, used for showing modality and attitude |
number | string (Number) | Count distinctions |
person | string (Person) | The distinction between the speaker, second person, third person, etcetera |
proper | string (Proper) | Distinguishes if the token is part of a proper name |
reciprocity | string (Reciprocity) | Reciprocal features of a pronoun |
tag | string (Tag) | A shorthand-tag for what part of speech the token is |
tense | string (Tense) | Time reference |
voice | string (Voice) | The relationship between the action that a verb expresses and the participants identified by its arguments |
// This output has been heavily edited for brevity
{
"output":{
"sentences":[
{
"text":"James Adams is a fantastic developer who has been on the scene for over ten years.",
"beginOffset":-1
},
{
"text":"His excellent attitude and steady work ethic contribute to his character.",
"beginOffset":-1
},
{
"text":"May he stay with our company for many years to come.",
"beginOffset":-1
}
],
"tokens":[
{
"dependencyEdge":{
"label":"NN",
"headTokenIndex":1
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER",
"tag":"NOUN",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"James",
"text":"James",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"ATTR",
"headTokenIndex":2
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"NOUN",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"developer",
"text":"developer",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"NSUBJ",
"headTokenIndex":8
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"THIRD",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"PRON",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"who",
"text":"who",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"AUX",
"headTokenIndex":8
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"INDICATIVE",
"person":"THIRD",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"VERB",
"tense":"PRESENT",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"have",
"text":"has",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"RCMOD",
"headTokenIndex":5
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"VERB",
"tense":"PAST",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"be",
"text":"been",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"PREP",
"headTokenIndex":8
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"ADP",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"on",
"text":"on",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"DET",
"headTokenIndex":11
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"DET",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"the",
"text":"the",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"POBJ",
"headTokenIndex":9
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"NOUN",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"scene",
"text":"scene",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"PREP",
"headTokenIndex":8
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"ADP",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"for",
"text":"for",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"ADVMOD",
"headTokenIndex":14
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"ADP",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"over",
"text":"over",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"NUM",
"headTokenIndex":15
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"NUM",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"ten",
"text":"ten",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"POBJ",
"headTokenIndex":12
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"PLURAL",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"NOUN",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"year",
"text":"years",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"P",
"headTokenIndex":2
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"PUNCT",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":".",
"text":".",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"ROOT",
"headTokenIndex":24
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"VERB",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"contribute",
"text":"contribute",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"PREP",
"headTokenIndex":24
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"ADP",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"to",
"text":"to",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"POSS",
"headTokenIndex":27
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"MASCULINE",
"mood":"MOOD_UNKNOWN",
"person":"THIRD",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"PRON",
"tense":"TENSE_UNKNOWN",
"case":"GENITIVE",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"his",
"text":"his",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"POBJ",
"headTokenIndex":25
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"NOUN",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"character",
"text":"character",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"P",
"headTokenIndex":24
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"NUMBER_UNKNOWN",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"PUNCT",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":".",
"text":".",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"AUX",
"headTokenIndex":31
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"GENDER_UNKNOWN",
"mood":"MOOD_UNKNOWN",
"person":"PERSON_UNKNOWN",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"VERB",
"tense":"TENSE_UNKNOWN",
"case":"CASE_UNKNOWN",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"May",
"text":"May",
"beginOffset":-1
},
{
"dependencyEdge":{
"label":"NSUBJ",
"headTokenIndex":31
},
"partOfSpeech":{
"voice":"VOICE_UNKNOWN",
"number":"SINGULAR",
"form":"FORM_UNKNOWN",
"gender":"MASCULINE",
"mood":"MOOD_UNKNOWN",
"person":"THIRD",
"aspect":"ASPECT_UNKNOWN",
"proper":"PROPER_UNKNOWN",
"tag":"PRON",
"tense":"TENSE_UNKNOWN",
"case":"NOMINATIVE",
"reciprocity":"RECIPROCITY_UNKNOWN"
},
"lemma":"he",
"text":"he",
"beginOffset":-1
},
],
"language":"en"
},
"raw-input":"James Adams is a fantastic developer who has been on the scene for over ten years. His excellent attitude and steady work ethic contribute to his character. May he stay with our company for many years to come."
}
Last updated