Data-analysis-with-pandas

pandas for data analysis
Fig. 1

Welcome to the tutorial Data analysis with pandas. In this tutorial i have covered all the topics of pandas and tried to explain each and every concepts with lesser words in jupyter notebook so that you can observe the function of every methods in pandas from the ground level.


First of all let’s understand “what is data analysis and why should we use pandas for analysis ?”.

What is data analysis ?

Suppose you are working in a company which daily generates a lot of data of customers and you are assigned a task to extract some useful information out of it with certain deadline. What will you do if you have very limited time you can not exract information just by looking into the dataset because size of the data is huge. So you asked for help from your collegue he said just read about pandas for data analysis. You study about pandas and you found that pandas makes your life easier than just looking at dataset and finding useful informations.

What does pandas actually do ?

This official documentation says-
pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

pandas is well suited for many different kinds of data:

Dependencies-

   pip install pandas 
   pip install numpy

Table of contents -

Introduction to pandas :


Fig.2

Pandas is used as data cleaning tool in the field of data science.You can do whatever operation you want in the dataset with this tool.Now question arises, can we clean or change the value in the dataset manually ? Answer is yes we can if size of the dataset is small.What if we have a large dataset then we can not do it manually it will take a lot of time.Pandas makes data science very easy and effective.
To use pandas you need to first import the pandas module in your program

     import pandas as pd 


Reading CSV and Excel sheets:

d=pd.read_csv(“path”):

d=pd.read_excel(“path”) :

For futher tutorial go to the above link given in the Table of contents or click this link

References :


dshahid380 Md Shahid