Week 6: Feature Implementation
Ishan B -
Last week, I discussed feature engineering and how it works, specifically the correlation between different features such as people with “less than a high school diploma” compared to the different districts to see the amount of people with different degrees is directly related to the population or if there are other factors that go into it. In the dataset, there was no column for population of a district, but there was information about people with “less than a high school diploma” and percent of people with “less than a high school diploma”. Using these numbers, I was able to generate the population of the district, which was the first feature I implemented as population of an area is one of the biggest factors when it comes to a business’s choice of where to open.
This week, I am generating more features based off of the different years provided in the dataset. One of the important metrics for a business wishing to open in a new area is the potential growth of the area. The change in population over time is a good metric to analyze this, but other metrics can also be analyzed. One other metric that may be important for certain types of businesses is the change in education level over time. Certain businesses are more likely to get certain demographics of people as customers, so being able to see, for example, the change in amount of people getting bachelor’s degrees could be useful information and could impact the decision of a businesses opening location.
Comments:
All viewpoints are welcome but profane, threatening, disrespectful, or harassing comments will not be tolerated and are subject to moderation up to, and including, full deletion.