Week 3: Programming

Ishan B -

Last week, my research focused on gaining a better understanding of regressions and how they work. This helped narrow my choices of which models to test to linear, polynomial, ridge, and lasso regression. After deciding which type of models to test, the next step in my research is to set up the programming interface. For this project, I will be using Google Colab alongside the pandas and numpy libraries. As for the models, Initially, I am planning to test the open-source regression models from Scikit-learn, and in the upcoming weeks once that is completed, if time permits, I will also test the open-source regression models from other libraries such as Keras, Tensorflow, and PyTorch.

Libraries

Numpy is a python library which allows users to create multi-dimensional arrays and matrices, while also allowing for the capability to perform mathematical operations on them. Numpy is an essential library for programmers planning to perform data science operations in python. Numpy is most useful for numerical data.

Pandas is a python library that has similar functions to Numpy, however it excels in tabular data and performing operations such as data cleaning, and filling missing values.

Scikit-learn, Keras, Tensorflow, and PyTorch are all python libraries that give access to different types of machine learning models such as regressions, random forest, and classification models. Each library generates its models differently, so each model may provide slightly different results when trained on the same dataset, which is why I am planning to implement all of them separately and compare them to analyze which one gave the best results for my use case.

More Posts

Comments:

All viewpoints are welcome but profane, threatening, disrespectful, or harassing comments will not be tolerated and are subject to moderation up to, and including, full deletion.

    christopher_y
    Hello Ishan! Your exploration of different python libraries like Numpy and Panda is really interesting to me. What have you found to be the biggest limitations with python libraries so far?
    camille_bennett
    Wow! Sounds like amazing work, Ishan. How do you get access to these libraries?
    ishan_b
    Thanks for the question Chris! One of the biggest limitations that I've noticed with Python libraries is that since Python itself is one of the slower programming languages (takes the longest time to run programs) when using complicated libraries, it can take a long time for the code to execute.
    ishan_b
    Thanks for the question Ms. Bennett! Python is very well integrated with popular libraries such as NumPy and Pandas, so to integrate them, all you need to do is run "import numpy" and "import pandas" at the top of your code and you can use any of the functions that they have. For more niche libraries, you may need to run "pip install" in the terminal to install the library.

Leave a Reply

Your email address will not be published. Required fields are marked *