plaintext_analyzer package

Submodules

plaintext_analyzer.entry module

plaintext_analyzer.file_reader module

class plaintext_analyzer.file_reader.PlaintextReader(source, stype, lang)

Bases: object

Read plain text file

text

content of file

Type:

str

property sentences

sentences in file content

Type:

list of str

plaintext_analyzer.file_writer module

class plaintext_analyzer.file_writer.CSVWriter(name)

Bases: object

Write object into csv file.

dstfile

destnation file name with extension

Type:

str

write(content)

Write content into csv

Parameters:

content (list of dict) – content to be written into csv file

plaintext_analyzer.file_writer.write_to_csv(fields, content, csvfile='output.csv')

Write content into csv file

Parameters:
  • fields (list of str) – heads of content

  • content (list of dict) – one item in content is one line in csv file

  • csvfile (str) – csv file path

plaintext_analyzer.file_writer.write_to_json(content, jsonfile='output.json')

Write content into json file

Parameters:
  • content (list of dict) – content to be written into json file

  • jsonfile (str) – json file path

plaintext_analyzer.phrase_analyzer module

class plaintext_analyzer.phrase_analyzer.PhraseAnalyzer(lang)

Bases: object

Analyze phrase of sentences

overview_phrases(sens)

Get phrases with explanation from sentences.

Parameters:

sens (list) – sentences

Returns:

one element contains phrases for each sentence. one element has keys: sentence, noun_phrases, prep_phrases, verb_phrases, verbs

Return type:

list of dict

plaintext_analyzer.structure_kg_analyzer module

class plaintext_analyzer.structure_kg_analyzer.StructureKGAnalyzer(lang)

Bases: object

Analyze sentence structure and kg of sentences

overview_structure_kg(sens)

Get structure and kg with translation from sentences.

Parameters:

sens (list) – sentences

Returns:

one element contains kg and structure for each sentence. one element has keys: sentence, kg, structure

Return type:

list of dict

plaintext_analyzer.structure_kg_analyzer.add_bracket(ex)

plaintext_analyzer.vocab_analyzer module

class plaintext_analyzer.vocab_analyzer.VocabAnalyzer(lang)

Bases: object

Analyze vocabulary of sentences

overview_vocabs(sens, google=False)

Get vocabularies with explanation from sentences, WITHOUT duplicates.

Parameters:

sens (list) – sentences

Returns:

vocabularies

Return type:

list

Module contents