PLO 6

"Conduct informatics analysis and visualization applied to different real-world fields, such as health science and privacy."

Definition:

Interpreting and displaying data in a manner that caters to specialized environments is valuable. Depending on the area of interest, data insights and knowledge can be harvested and expressed using different methodologies and software. Topics of interest can be explored through analytical means, such as through the use of software applications like Tableau or Microsoft’s Power BI. Programming languages like R and Python can be used to clean and analyze data. While insights discovered through data can seem factual, revelations should be scrutinized according to the validity of the data it comes from.

Data wrangling is also an essential part of the process of informatics analysis and visualization. Researchers must validate the integrity of sources and understand where the data may be faltering and incomplete. Data sources that one domain uses may not be available for other domains. Similarly, data collection technology varies depending on what and how a subject can be studied. In either case, I believe it is essential that informaticians become familiar with the limits that their data possess, so that the inherent truth is presented as accurately as possible.

Description:

#1: Mini Project: Regional Labor Force Data

For this assignment, I described my data wrangling, cleaning, and analysis journey with the labor employment data I used. I explained why I chose the topic, supporting dataset, origins of the dataset, and my experience with wrangling the data. The data gathering process showed me how to decipher when a dataset will be useful for analytical purposes. Additionally, I used Python3 and JupyterLab to clean and access the data I collected. Within Python, I utilized the pandas library to help filter results as well.

Throughout this assignment, I was using a prescribed framework designed to approach data analysis problems from an organized standpoint. This framework simplified the data analysis research to a few steps: have a goal, find appropriate data sets, and use technology as an assistant to augment my effort. It was evident to me that technology is extremely beneficial for munging and filtering large datasets, but I should not center my research around what technology is available to me. Technology that I use for analytical purposes will be for supporting my workflow rather than directing my attention.

Mini-Project: Regional Labor Force Data.pdf
Mini-Project Regional Labor Force Data Report.pdf

#2: Writing Assessment: Stamford Hospital Data Analytics

I reviewed a presentation of Stamford Hospital’s transition from their original data analytics framework to their adoption of Tableau. The original setup was very fragmented and did not offer nearly the same amount of valuable insight that was garnered through the use of Tableau. When incorporating Tableau, the hospital was able to save money, identify areas of improvement, and catalog resource intensive areas. I found that the presentation highlighted the vast improvements offered by Tableau’s analytical capabilities. The presentation I reviewed showed me how beneficial analytics can be in a healthcare setting. What was particularly interesting was the fact that the insight derived from the analytical software influenced multiple departments, from patient care to financial reporting.

Stamford Hospital Data Analytics.pdf

#3: Writing Assessment: Analyzing Healthcare Data Sources

In this briefing, I reviewed data sources in the realm of healthcare. I explain how new technology can obfuscate the value of healthcare data if standardization processes are not in place. Having too many data sources with just as many file types will not produce insights and will only complicate matters further. Once the mountain of data has been analyzed, the insights may already be outdated because by the time the analysis took place, more data was accumulated. Also, faulty collection devices could lead to incorrect insights, therefore ongoing maintenance of data collection devices is necessary.

The sheer amount of unstructured data in healthcare settings is enough to warrant the implementation of software and devices that analyze such data. Machine learning can assist data analysis; however, this novel tool requires a learning period. The training sessions are crucial for machine learning because it sets the foundational knowledge for the algorithms and this period is when researchers must discuss and prevent biases from being passed on. I found that there are many ways to collect data and that this data collected also comes in different formats. This assignment emphasizes my understanding of the different types of data and their limiting factors. Additionally, I consider the significant improvements and benefits that technology can bring about to solve the problem of too much data, especially in the healthcare setting.

Analyzing Healthcare Data Sources.pdf

Discussion:

Data analytics and visualization are useful to various businesses for a wide range of reasons. Analytics can help identify areas of improvement and, subsequently, reduce business costs. These enhancements will lead to better workflows and faster knowledge accumulation. Business intelligence will also witness growth as a result. Software with specific purposes can be used across different industries, but the actual data analytics and collection may be customized according to what a particular industry needs. Nonetheless, insights derived from data analytics and visualization demonstrate the areas of opportunity within these businesses. Data visualization further helps display the advancements a business has made and could make.