• preprocessing,
  • classification
  • and automatic
  • summarization
  • of emails

Melusine is a high-level Python library for emails classification and feature extraction, written in Python and capable of running on top of Scikit-Learn, Keras or Tensorflow. Melusine is designed for the preprocessing, classification and automatic summarization of emails written in french.

Features

  • Email preprocessing Toolbox
  • Custom email preprocessing Pipeline
  • NLP Toolbox
  • Phraser and Tokenizer Pipeline
  • Embeddings training
  • Metadata Pipeline
  • Summarization by Keywords extraction
  • JSON Config file valuable to preprocessing that can be customized
  • Custom name file to discard or flag according to name relevance
  • Classification Toolbox
  • Convolutional networks and Recurrent networks
  • Runs seamlessly on CPU and GPU
  • Semantic detector for sentiment score prediction and topic search
logo

easy preprocessing

Provide a TransformerScheduler class to build your own transformer and integrate it into a scikit-learn Pipeline

logo

high adaptability

Freedom to train his own classification and automatic summarization model according to the constraints your problem