DiscoverDataScience.org

  • Online
    • Online Masters in Business Analytics
    • Online Masters in Data Analytics
    • Online Masters in Data Science
    • Online Masters in Health Informatics
    • Online Masters in Information Systems
    • Top Affordable Online Master’s in Data Science
  • Programs
        • Bachelors in Data Science
        • Minor in Data Science
        • Masters in Data Science
        • MBA in Data Science / Data Analytics
        • Data Science PhD Programs
        • Additional Programs
        • Data Science Bootcamps
        • Data Science Certificate Programs
        • Associates Degree in Data Science
  • Related Programs
        • Masters in Business Analytics Programs
        • Masters in Data Analytics Programs
        • Masters in Health Informatics Programs
        • Masters in Information Systems Programs
        • PhD in Health Informatics
        • PhD in Information Systems
        • Other Degrees and Certificate Programs
        • Accounting Analytics
        • Actuarial Science
        • Cyber Security
        • Data Analytics and Visualization
        • Geographic Information Systems (GIS)
        • Sports Analytics
  • Schools By State
    • California
    • Florida
    • Georgia
    • Maryland
    • New Jersey
    • New York
    • Pennsylvania
    • Texas
    • Virginia
    • All Schools by State
  • Careers & Salary
        • Career Guides – How to Become:
        • Business Analyst
        • Business Intelligence Analyst
        • Data Analyst
        • Data Scientist
        • Machine Learning Engineer
        • Statistician
        • All Career Guides
        • Salary Guides
        • Careers in Data Science
        • Business Analyst
        • Data Analyst
        • Data Scientist
  • Resources
        • Articles
        • Data Science in the Health Care Industry
        • Data Storytelling
        • How to Use Deepfake
        • Journey through Data Science with the Data Professor
        • Top Reasons to Become a Data Scientist
        • What is Python and Why Important
        • + All Articles
        • FAQ
        • Data Analyst vs Data Scientist
        • Data Science vs Computer Science
        • Do You Need a PhD to Become a Data Scientist?
        • How to Get a Job as a Data Scientist?
        • Is Data Science Hard?
        • Is a PhD in Data Science Worth It?
        • What Can I Do With a Masters in Statistics?
        • What is Business Analytics?
        • What is Data Analytics?
        • +All FAQs
        • Social Good
        • Clean Water
        • Cyberbullying
        • Mental Health
        • Nonprofits
        • +All Social Good
        • Data Science in Industry
        • Artificial Intelligence AI
        • Biotechnology
        • Clean Energy
        • Health Care
        • Logistics
        • Marketing
        • Sports
        • + All Industries
        • Data Science Training Toolkits
        • Java
        • SAS
        • SQL
        • Tableau
        • +All Training
        • More Resources & Helpfull Guides
        • Data Science and Sustainability
        • Expert Interviews
        • Exploring a Career with Numbers
        • Income Sharing Agreements
        • Making Room for Diverse Populations in STEM
        • Scholarship Guide
        • +More Resources
        • Top Picks
        • Best Master’s Data Science Programs for 2023
        • Best Bachelor’s Data Science Programs for 2023
        • The Most Affordable Data Science Bachelor’s Programs for 2023
        • The Most Affordable Data Science Master’s Programs for 2023
FIND A PROGRAM
1
2
3
4
Sponsored Content

Data Science for Law Enforcement

By Kat Campise, Data Scientist, Ph.D.

While data science is generally associated with tooling machine learning algorithms and making precise predictions or helping to design new digital (or non-digital) products for the likes of Google, Facebook, Apple, and Amazon, data scientists can make a positive impact specifically for social good.

FIND SCHOOLS
Sponsored Content

Crime is a ubiquitous problem that penetrates a society regardless of the country or culture. Whether an individual or group is focused on stealing tangible property, an individual’s identity or seeking to harm others physically, we live in an era where just about everything we do is recorded and stored on numerous servers throughout the world. As such, a treasure trove of data is available to law enforcement agencies — that is if the social media giants and other data collection organizations (including individuals who may record a crime via their phones) agree to share the data.

Featured Programs:
Sponsored School(s)
Southern New Hampshire University Logo
Southern New Hampshire University
Featured Program: AS, BS and MS Data Analytics
Request Info
UC Berkeley Logo
UC Berkeley
Featured Program: UC Berkeley’s Master of Information and Data Science | Online
Request Info
George Mason University Logo
George Mason University
Featured Program: MS in Data Analytics Engineering and Certificate in Data Analytics
Request Info
Grand Canyon University Logo
Grand Canyon University
Featured Program: Online Technology Master's Degree Programs in the following career paths: IT Project Manager, Information Technology Manager, Database Administrator, Computer Systems Analyst and many more.
Request Info
Purdue Global Logo
Purdue Global
Featured Program: Associate of Applied Science in Information Technology - Data Analytics; Master of Science in Information Technology - Data Analytics; Professional Focus + Google Data Analytics Certificate
Request Info
Arizona State University - Online Logo
Arizona State University - Online
Featured Program: Online Bachelor of Science in Data Science
Request Info
University of Virginia Logo
University of Virginia
Featured Program: A top-tier master's in data science designed for working professionals
Request Info

In the U.S., the incidences of violent and property crime have steadily decreased since 1990. Per the U.N.’s 2017 report on world crime, similar decreases in crime have also been evident on a global level over the last 17 years. Given that criminals use the same (or similar) tools as non criminals, e.g., smartphones, social media, etc., and machine learning/AI algorithms are becoming “smarter” in their nuanced detection of the likelihood of fraudulent or criminal activities, the downshift in crime statistics shouldn’t be all too surprising.

But, there is definitely more that can be done by using data science tools and processes for identifying an actual threat before the criminal (or criminals) can carry out their plan.

Imperfection of Data Science as Digital Detective Work

Assessing the fine line between an empty threat on Twitter and determining that an outrageously inflammatory threat is the real deal isn’t an easy task. This is compounded by the fact that machine learning algorithms (and potentially, AI algorithms) are susceptible to the biases of those who are creating them. Granted, these biases may not be intentional. But, anyone entering the field of data science needs to be continually aware that, at the end of the day, we are still operating from a filtered perceptual view of the world.

Machine learning and AI operate based on the quality and amount of data fed into them as well as the underlying assumptions of the scientist (or programmer) who is constructing the algorithms. Thus, at every step of the data science process, you must perpetually self-analyze and test your own internal assumptions about what you’re doing in addition to meeting the statistical assumptions for the particular statistical model you’re about to build and deploy.

A recent example of unintended bias is the Microsoft experiment in 2016 where “a chatbot plugged into Twitter famously created a racist machine that switched from tweeting that “humans are super cool” to praising Hitler and spewing out misogynistic remarks.”

Yet another example of predictive algorithm imperfection is the 2014 case where a risk assessment score identified an African American female as having an increased likelihood of being a repeat offender; however, a Caucasian man who actually had been convicted of prior felonies (unlike the aforementioned female) was rated as a lower risk for repeat offenses. There was another twist here: the Caucasian man continued to commit crimes whereas the African American female did not.

The message here is clear: algorithms are a tool for decision making, they are not pure mathematical models that can be left unattended after deployment, nor should they be wholly relied upon. Notably, as AI becomes the go-to method for an increasing number of systems, there is another concern that surrounds the expanding complexity of machine learning and AI algorithms.

Ultimately, AI will develop its own heuristics — or shortcuts for decision making –, and it’s likely this will create a black box scenario whereby “it will become increasingly difficult for even the engineers who created an AI system to explain the choices it made.” As such, it’s imperative that data scientists maintain an awareness of this likelihood.

We do not yet know precisely what outcomes AI will produce, and the data we have regarding individuals is wrought with context; it’s also not perfectly clean nor is it, by any means, entirely complete. Thus, data scientists in law enforcement should, ideally, have knowledge and experience in the world of law enforcement techniques and training; this is in addition to the in-depth math, statistics, and programming skills required for accurate prediction modeling.

Data Science Crime Detection Tools

In the past, law enforcement agencies have been a fragmented data sharing matrix. The situation has improved through the installation of national databases, such as the Federal Bureau of Investigation’s National Crime Information Center, crime mapping software, internal organizational management tools, e.g., CompStat, and OneDOJ — to name just a few.

Law enforcement data scientists might also be tasked with culling information from social media channels and merging that information with data from their internal database; this may or may not require data scraping experience depending on whether they already have someone else dedicated to culling this data.

The caution here is to ensure that you’re building an accurate crime detection profile and maintaining the discernment between the varying degrees of data representing circumstantial evidence (people will run a Google search or post odd things to social media that may or may not be expressions of criminal intent) or pointing towards a resolution to commit a crime.

Also, as frequently stated, not all data points may be available, e.g., text analytics of interview transcripts containing victim or witness statements.

Natural Language Processing and Law Enforcement

This brings us to another area of expertise that law enforcement data scientists should have: natural language processing.

While our handy machines (computers) are wondrous at massive computational tasks, human language, in all of its contextualized diversity and perceptual translation, isn’t readily decomposed into the binary realm of ones and zeros. Human beings are still far more masterful in this area. For example, a deep learning algorithm was programmed to write Burger King advertisements. The algorithm in question arrived at a veritable word salad: Gender reveal bad. Tender reveal young. It is a boy bird with crispy chicken tenders from Burger Thing. So, AI is still not yet ready to replace human writers (or readers) as it doesn’t currently have the capacity to understand the meaning of what it’s producing.

Taken a step further, words (sentences and paragraphs) also have meaning to the reader, which has a high likelihood of being perceived differently than the writer intended.

Summarily, human language has a compendium of moving parts that statistical algorithms need to take into consideration, and this isn’t easy considering math is its own language. Law enforcement isn’t merely a static collection of data points. Accurate assessment of crime requires interpreting eyewitness accounts, 911 dispatch recordings, and police body cam footage (more specifically, trying to understand conversations that can’t be clearly heard while reviewing the footage). While these can be recorded and AI can transcribe them, as well as analyzed for emotional distress and inflection points (this is an area of continued research), the human element is still an important aspect for accurately interpreting what the report is likely to determine.

Areas of study within natural language processing include computational linguistics and computational psychology.

Digital Evidence Management: Body Cams

Video evidence isn’t always easy to analyze.

Everyone (or almost everyone) has a phone-based camera, and body cams are steadily being required for police officers throughout the U.S. Hours upon hours of footage is often reviewed during an investigation, and the video quality may not be ideal due to inclement weather or body cam malfunctioning.

Data scientists can assist in shortening video review time by utilizing AI to designate “a zone in the video frame, where any movement causes an alert to generate.” Once the alert is established, the frame can then be tagged for further analysis.

Facial recognition is another area where data scientists can help law enforcement to identify criminals who have not yet been detained on outstanding warrants.

Although body cam AI is still in the research and refinement stage, facial recognition algorithms can be used to immediately notify police officers if they are in the presence of a perpetrator who has a current warrant out for their arrest.

Data Science Algorithms for Crime Detection

It should be clear by now that algorithmic accuracy is largely dependent on the skill of the statistician or data scientist and the quality of the data.

There are a plethora of statistical models available for use within law enforcement. The list below is not exhaustive; it’s a starting point rather than a comprehensive detailing of all possible models that are relevant to crime detection. Furthermore, only general descriptions are included as an extensive analysis of when, how, and why each statistical tool is beyond the scope of this article.

  • Logistic regression: This is one of the most utilized techniques in machine learning (and data science). It’s a relatively straightforward binary classification tool; however, it can expand in complexity. Logistic regression may be used as the model for repeat offender risk scores.
  • Clustering algorithms: In terms of crime, cluster analyses can also be used in risk scoring as well as predicting which neighborhoods will have a higher likelihood of increases or decreases in crime rates. There are different types of clustering techniques including hierarchical, centroid-based, distribution-based, and density-based. One or more of the clustering tools can be used contingent on the type of data and the goal of the analysis.
  • Convolutional Neural Networks: Many machine learning and AI models have been patterned after biological frameworks, and CNN falls into this classification. Basically, the construct of this model simulates how the visual cortex operates via neuronal-type nodes which constitute layers between the input and output values. The CNN technique is frequently used for image and video analysis.
  • Convolutional Deep Belief Networks: Although CDBNs are also used in video analysis, they can be useful for analyzing audio as well. This is particularly the case when high dimensional data is the input value (or values) for analysis. CBNs are comprised of multiple layers with one being visible and the rest are hidden. Classifying the probability of who is speaking and what is being said on audio recordings is one (but not the only) way that CDBNs can be useful in data science for law enforcement.
  • Recurrent Neural Networks: RNNs have been used for speech and text (or handwriting) recognition in the past. Within the RNN classification, Long-short term memory (LTSM) networks have also been used for speech recognition and speech-to-text tasks. Emergency call analysis and reconstructing other types of audio transcripts, such as witness and victim interviews, are the possible uses of RNNs in law enforcement.

Conclusion

Research into increasing predictive accuracy is ongoing. Academics, industry-specific machine learning engineers, data scientists, and AI enthusiasts continue on their quest to advance the machine learning, deep learning, and AI tools for a variety of objectives.

Beyond creating a better “virtual assistant” lies an area of application where algorithms and informed human decision-making can, and should, operate hand in hand.

We shouldn’t over-rely on either component. Contrary to the puritanical belief that algorithms are devoid of the biases inherent in human perception, we’ve witnessed that this is, in fact, untrue. The tools we use for prediction are susceptible to confirmation bias.

It can’t be overstated that, ultimately, algorithmic constructs are created by humans for humans — whether to improve a consumer shopping experience or to quickly identify and prevent criminal activity. Needless to say, humans are prone to error.

FIND SCHOOLS
Sponsored Content
FIND A PROGRAM
1
2
3
4
Sponsored Content
  • Career Guides
  • Artificial Intelligence Engineer
  • Business Analyst
  • Business Intelligence Analyst
  • Data Analyst
  • Data Analytics Manager
  • Data Architect
  • Data Engineer
  • Data Mining Specialist
  • Database Administrator
  • Database Developer
  • Information Security Analyst
  • Machine Learning Engineer
  • Marketing Analyst
  • Software Developer
  • Statistician
  • Data Science Toolkit
  • Hadoop
  • Hive
  • Java
  • Python
  • R
  • SAS
  • SQL
  • Tableau
  • Data Science Articles
  • 10 Data Science Types
  • AI and Data Science
  • The Increasing Importance of Health Informatics
  • Python Growth Rate Predictions
  • Data-as-a-Service (DaaS)
  • Data Science Trends 2023
  • Cybersecurity Analyst vs. Engineer
  • Data Science in Education
  • Do You Need a PhD to Become a Data Scientist?
  • Best Big Data Conferences 2023
  • Data Science Focus Areas
  • Is a PhD in Data Science Worth It?
  • Is Data Science Hard?
  • Marketing Analytics Degree Online
  • Transferable Data Science Skills
  • Transitioning to Data Science
  • What Can I Do With a Masters in Statistics?
  • What Companies Hire Data Scientists?
  • What Is Cyber Science?
  • How to Read Crypto Charts
  • Breaking Down the Top Data Science Algorithms + Methods
  • Journey through Data Science with the Data Professor
  • How to Build a Data Science Portfolio & Resume
  • The Significance of Data Community Building
  • Developer Impostor Syndrome
  • How to Improve Programming Skills
  • Data Science Degree Vs. Training
  • Why Data Destruction is Important for your Business
  • Data Storytelling: Mastering Data Science’s Core Skillset
  • What is a Marketing Funnel and How to Create One
  • Building a Data Science Brand
  • Interviewing for Data Careers
  • Top 5 Reasons to Become a Data Scientist
  • What is Data Analytics?
  • What is Business Analytics?
  • What is Quantum Machine Learning?
  • What is Predictive Analytics?
  • Data Science vs. Statistics
  • Data Mining vs. Machine Learning
  • Business Analyst vs. Data Scientist
  • Data Scientist vs. Software Engineer
  • Data Science vs. Computer Science
  • Data Engineer vs. Data Scientist
  • Data Analyst vs. Data Scientist
  • How to Use Deepfake Technology
  • Java vs. JavaScript
  • What Is Python Used For & Why Is It Important to Learn?
  • Artificial Intelligence as a Trending Field
  • Data Science in Health Care
  • Guide to a Career in Criminal Intelligence
  • Guide to a Career in Health Informatics
  • Guide to Geographic Information System (GIS) Careers
  • Data Science Ph.D.
  • Expert Interview: Dr. Sudipta Dasmohapatra
  • Expert Interview: Sandra Altman
  • Expert Interview: Tony Johnson
  • Expert Interview: Bob Muenchen
  • Industries Using Data Science
  • Artificial Intelligence
  • Biotechnology
  • Finance
  • Health Care
  • Insurance
  • Law Enforcement
  • Logistics
  • Marketing and Advertising
  • Sports
  • Clean Energy
  • Online Guides
  • Data Science
  • Data Analytics
  • Business Analytics
  • Information Systems
  • Health Informatics
  • Programs
  • Online
  • Resources
  • Related Programs

© Copyright 2025 | https://www.discoverdatascience.org | All Rights Reserved

  • Home
  • About Us
  • Privacy Policy
  • Terms of Use