Python Data Analysis Library — pandas: Python Data Analysis Library
Python Data Analysis Library¶
pandas is an open source, BSD-licensed library providing high-performance,
easy-to-use data structures and data analysis tools for the Python programming language.What problem does pandas solve?¶
Python has long been great for data munging and preparation, but less so for
data analysis and modeling. pandas helps fill this gap, enabling you to
carry out your entire data analysis workflow in Python without having to
switch to a more domain specific language like R.Combined with the excellent IPython toolkit and
other libraries, the environment for doing data analysis in Python excels in
performance, productivity, and the ability to collaborate.pandas does not implement significant statistical modeling outside of linear
and panel regression; for this, look to the statsmodels project. More work is still needed to make Python a
first class statistical modeling environment, but we are well on our way
toward that goal.