nltk.toolbox.ToolboxSettings¶
- class nltk.toolbox.ToolboxSettings[source]¶
Bases:
StandardFormatThis class is the base class for settings files.
- parse(encoding=None, errors='strict', **kwargs)[source]¶
Return the contents of toolbox settings file with a nested structure.
- Parameters
encoding (str) – encoding used by settings file
errors (str) – Error handling scheme for codec. Same as
decode()builtin method.kwargs (dict) – Keyword arguments passed to
StandardFormat.fields()
- Return type
ElementTree._ElementInterface
- fields(strip=True, unwrap=True, encoding=None, errors='strict', unicode_fields=None)[source]¶
Return an iterator that returns the next field in a
(marker, value)tuple, wheremarkerandvalueare unicode strings if anencodingwas specified in thefields()method. Otherwise they are non-unicode strings.- Parameters
strip (bool) – strip trailing whitespace from the last line of each field
unwrap (bool) – Convert newlines in a field to spaces.
encoding (str or None) – Name of an encoding to use. If it is specified then the
fields()method returns unicode strings rather than non unicode strings.errors (str) – Error handling scheme for codec. Same as the
decode()builtin string method.unicode_fields (sequence) – Set of marker names whose values are UTF-8 encoded. Ignored if encoding is None. If the whole file is UTF-8 encoded set
encoding='utf8'and leaveunicode_fieldswith its default value of None.
- Return type
iter(tuple(str, str))
- open(sfm_file)[source]¶
Open a standard format marker file for sequential reading.
- Parameters
sfm_file (str) – name of the standard format marker input file