Tanagra (machine learning) - Wikiwand
For faster navigation, this Iframe is preloading the Wikiwand page for Tanagra (machine learning).

Tanagra (machine learning)

From Wikipedia, the free encyclopedia

The topic of this article may not meet Wikipedia's notability guidelines for products and services. Please help to demonstrate the notability of the topic by citing reliable secondary sources that are independent of the topic and provide significant coverage of it beyond a mere trivial mention. If notability cannot be shown, the article is likely to be merged, redirected, or deleted.Find sources: "Tanagra" machine learning – news · newspapers · books · scholar · JSTOR (May 2013) (Learn how and when to remove this template message)
Developer(s)Lumière University Lyon 2
Stable release
1.4.50 / 2013/12/18
Operating systemWindows
TypeMachine Learning, Data mining, Multivariate analysis, Data analysis
LicenseOpen Source

Tanagra is a free suite of machine learning software for research and academic purposes developed by Ricco Rakotomalala at the Lumière University Lyon 2, France.[1] Tanagra supports several standard data mining tasks such as: Visualization, Descriptive statistics, Instance selection, feature selection, feature construction, regression, factor analysis, clustering, classification and association rule learning.

Tanagra is an academic project. It is widely used in French-speaking universities.[2] Tanagra is frequently used in real studies[3] and in software comparison papers.[citation needed]


The development of Tanagra was started in June 2003. The first version was distributed in December 2003. Tanagra is the successor of Sipina, another free data mining tool which is intended only for supervised learning tasks (classification), especially the interactive and visual construction of decision trees. Sipina is still available online and is maintained. Tanagra is an "open source project" as every researcher can access the source code and add their own algorithms, as long as they agree and conform to the software distribution license.

The main purpose of the Tanagra project is to give researchers and students a user-friendly data mining software, conforming to the present norms of the software development in this domain (especially in the design of its GUI and the way to use it), and allowing the analyzation of either real or synthetic data.

From 2006, Ricco Rakotomalala made an important documentation effort. A large number of tutorials are published on a dedicated website. They describe the statistical and machine learning methods and their implementation with Tanagra on real case studies. The use of other free data mining tools on the same problems is also widely described. The comparison of the tools enables readers to understand the possible differences in the presentation of results.


A screenshot of Tanagra software
A screenshot of Tanagra software

Tanagra works similarly to current data mining tools. The user can design visually a data mining process in a diagram. Each node is a statistical or machine learning technique, the connection between two nodes represents the data transfer. But unlike the majority of tools which are based on the workflow paradigm, Tanagra is very simplified. The treatments are represented in a tree diagram. The results are displayed in an HTML format. This makes it is easy to export the outputs in order to visualize the results in a browser. It is also possible to copy the result tables to a spreadsheet.

Tanagra makes a good compromise between statistical approaches (e.g. parametric and nonparametric statistical tests), multivariate analysis methods (e.g. factor analysis, correspondence analysis, cluster analysis, regression) and machine learning techniques (e.g. neural network, support vector machine, decision trees, random forest).

See also


  1. ^ Rakotomalala, Ricco. (2005). TANAGRA: a free software for research and academic purposes. EGC'2005.
  2. ^ G. Gregoire, F.X. Jollois, J.F. Petiot, A. Qannari, S. Sabourin, P. Swertwaegher, J.C. Turlot, V. Vandewalle, S. Viguier-Pla, "Software and statistics teaching in STID department of IUT", in Statistique et Enseignement, 2(2), 5-24, 2011.
  3. ^ E. Kirkos, C. Spathis, A. Nanopoulos, Y. Manolopoulos, "Identifying Qualified Auditor's Opinions: A Data Mining Approach", in Journal of Emerging Technologies in Accounting, 4(1), 183-197, 2007.
{{bottomLinkPreText}} {{bottomLinkText}}
Tanagra (machine learning)
Listen to this article