cnt.rulebase.rules.interval_based_operations package¶
Submodules¶
cnt.rulebase.rules.interval_based_operations.basic_operation module¶
Collect the unicode codepoint specified by intervals.
-
class
cnt.rulebase.rules.interval_based_operations.basic_operation.
BasicIntervalBasedOperation
(intervals)[source]¶ Bases:
object
-
class
cnt.rulebase.rules.interval_based_operations.basic_operation.
IntervalBasedOperationLabelProcessor
(input_sequence, index_labels_generator, config)[source]¶ Bases:
cnt.rulebase.workflow.basic_workflow.BasicLabelProcessor
-
class
cnt.rulebase.rules.interval_based_operations.basic_operation.
IntervalBasedOperationOutputGenerator
(input_sequence, label_processor_result, config)[source]¶ Bases:
cnt.rulebase.workflow.basic_workflow.BasicOutputGenerator
cnt.rulebase.rules.interval_based_operations.builtin_application module¶
TODO
-
class
cnt.rulebase.rules.interval_based_operations.builtin_application.
BuiltInCollector
[source]¶ Bases:
object
-
chinese_chars
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollector object>¶
-
chinese_chars_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollectorLazy object>¶
-
chinese_sentence_chars
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollector object>¶
-
chinese_sentence_chars_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollectorLazy object>¶
-
delimiters
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollector object>¶
-
delimiters_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollectorLazy object>¶
-
digits
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollector object>¶
-
digits_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollectorLazy object>¶
-
english_chars
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollector object>¶
-
english_chars_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_collector.IntervalBasedCollectorLazy object>¶
-
-
class
cnt.rulebase.rules.interval_based_operations.builtin_application.
BuiltInReplacer
[source]¶ Bases:
object
-
REGISTERED_REPL_KEY
: Dict[str, Callable[Callable[str, str]]] = {'empty': <function BuiltInReplacer.<lambda>>, 'space': <function BuiltInReplacer.<lambda>>, 'tab': <function BuiltInReplacer.<lambda>>}¶
-
chinese_chars
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
chinese_chars_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
chinese_chars_spaced
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
chinese_chars_spaced_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
chinese_chars_spaced_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
chinese_chars_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
delimiters
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
delimiters_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
delimiters_spaced
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
delimiters_spaced_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
delimiters_spaced_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
delimiters_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
digits
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
digits_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
digits_spaced
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
digits_spaced_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
digits_spaced_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
digits_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
english_chars
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
english_chars_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
english_chars_spaced
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacer object>¶
-
english_chars_spaced_lazy
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerLazy object>¶
-
english_chars_spaced_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
english_chars_to_string
= <cnt.rulebase.rules.interval_based_operations.interval_based_replacer.IntervalsCollectionBasedReplacerToString object>¶
-
classmethod
generate_param
(repl_with_intervals_collection)[source]¶ - Return type
Dict
[Callable
[[str
],str
],List
[Tuple
[int
,int
]]]
-
cnt.rulebase.rules.interval_based_operations.interval_based_collector module¶
Collect the unicode codepoint specified by intervals.
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_collector.
IntervalBasedCollector
(intervals)[source]¶ Bases:
cnt.rulebase.rules.interval_based_operations.basic_operation.BasicIntervalBasedOperation
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_collector.
IntervalBasedCollectorLazy
(intervals)[source]¶ Bases:
cnt.rulebase.rules.interval_based_operations.basic_operation.BasicIntervalBasedOperation
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_collector.
IntervalBasedCollectorOutputGenerator
(input_sequence, label_processor_result, config)[source]¶ Bases:
cnt.rulebase.rules.interval_based_operations.interval_based_collector._IntervalBasedCollectorOutputGenerator
cnt.rulebase.rules.interval_based_operations.interval_based_replacer module¶
Replace the unicode codepoint specified by intervals with arbitary strings.
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacer
(replacer_intervals)[source]¶
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacerConfig
(labeler2repl)[source]¶
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacerLabelProcessor
(input_sequence, index_labels_generator, config)[source]¶ Bases:
cnt.rulebase.workflow.basic_workflow.BasicLabelProcessor
-
result
()[source]¶ Label processor could generate any return type. Derived class must override this method.
- Return type
Generator
[Tuple
[int
,Optional
[Type
[IntervalLabeler
]]],None
,None
]
-
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacerLazy
(replacer_intervals)[source]¶
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacerOperation
(replacer_intervals)[source]¶ Bases:
cnt.rulebase.rules.interval_based_operations.basic_operation.IntervalsCollectionBasedOperation
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacerOutputGenerator
(input_sequence, label_processor_result, config)[source]¶ Bases:
cnt.rulebase.rules.interval_based_operations.interval_based_replacer._IntervalsCollectionBasedReplacerOutputGenerator
-
class
cnt.rulebase.rules.interval_based_operations.interval_based_replacer.
IntervalsCollectionBasedReplacerOutputGeneratorLazy
(input_sequence, label_processor_result, config)[source]¶ Bases:
cnt.rulebase.rules.interval_based_operations.interval_based_replacer._IntervalsCollectionBasedReplacerOutputGenerator
Module contents¶
Collect or replace any unicode codepoint intervals.