Source
DataLA: Information, Insights, and Analysis from the City of Angels | Los Angeles - Open Data Portal
Data Set
Crime Data from 2020 to Present | Los Angeles - Open Data Portal
Overview of Data Refinement
This section outlines key steps undertaken to enhance the clarity, efficiency, and overall utility of the dataset.
Elimination of Redundant Columns
To streamline the dataset, certain columns were removed due to duplication or lack of added value:
- Premise Code: Removed, as 'Premise Description' provides comprehensive details, making coded references unnecessary.
- Status Code: Omitted in favor of 'Status Description', which offers more transparent and descriptive status information.
- Area Code: Deleted because 'Area Name' provides a clearer representation of location data.
- Weapon Code: Excluded since 'Weapon Description' provides a detailed account of weapons involved.
- Part 1-2 Column: Removed for its lack of significant information contribution to the analysis.
- MO Codes: Segregated for a more detailed investigation of crime patterns and modus operandi.
- Crime Code 1: Eliminated to prevent duplication with 'Crime Code', facilitating easier data interpretation.
Formatting Improvements
Several formatting enhancements were made to boost data consistency and readability:
- Column Renaming: 'CRM CD' has been updated to 'Primary CRM CD' for clearer identification.
- Time Format Standardization: Changed from 24-hour to 12-hour notation to match standard time-reading practices.