Preparing for the future of medicine: Considering the need for data-literate physicians

ABSTRACT: Artificial intelligence will undoubtedly change the landscape of medical practice and give rise to new possibilities for patient care. Today’s physicians are in a unique position to lead this era of technological integration and innovation. Large amounts of patient data currently being collected will be leveraged to provide clinical insight into personalized health care on a global scale. However, there remain many barriers to wide-scale implementation of artificial intelligence tools, including our historic practices of collecting and documenting patient data, as well as the organization of our health care system. Furthermore, a lack of training has physicians underprepared to reap the full potential of these tools and safeguard against adverse consequences for patient care. Given the exciting opportunities for the use of artificial intelligence in medicine, ways to overcome associated challenges must be found.

The revolution is here: Artificial intelligence and machine learning are being applied to medicine, though there are considerable challenges ahead.

In 2014 the BC Ministry of Health’s report, “Setting Priorities for the BC Health System,” outlined information management and technology as one of seven strategies to enable efforts toward creating health system change. One subcomponent of this strategy is to build BC’s “informatics capacity to use data to enhance decision making and improve outcomes at all levels of the system.”[1,2] Thus far, the medical community has started to adopt digitization of health information in practice to better collect, communicate, and store medical data.[3] In 2016, the Canadian Medical Association (CMA) reported that 75% of general practitioners use electronic medical records (EMRs) to enter or retrieve patient notes, an increase from 26% in 2007.[4] More recently, the role of artificial intelligence (AI) in medicine has surfaced.[5] AI is a field that involves computers using data to simulate human cognition, including our ability to reason, discover meaning, generalize, and learn from experiences in order to make informed decisions.[3] As we move into the intelligence age of medicine, physicians will have to adapt in order to harness the true potential and benefits of AI for patients.

Benefits of AI integration in medicine

Machine learning (ML), a subfield of AI, has recently received attention for its potential impact on medical practice. ML employs algorithms that can learn patterns from large data sets without being explicitly programmed, and can be equipped with self-correcting tools to improve accuracy with exposure to new data.[6] For example, ML is widely used by online retailers to identify additional products a shopper may be interested in based on their previous purchases.[6] In medicine, ML maps structured data, in the form of patient input variables (e.g., imaging, genetic markers, laboratory results, a patient’s age, gender, symptoms, or medications) to an output (outcome of interest), providing a pattern that can be used to make predictions in future cases. To understand how this is playing out in practice to enhance personalized medicine,[7] we look at a few applications of AI in relation to breast cancer.


In 2007, the FDA approved MammaPrint, which uses the weighted average of 70 measured genes in patients with early-stage breast cancer to predict 10-year recurrence risk. It is being used clinically by oncologists to select patients for adjuvant chemotherapy treatment.[8]


In 2018, the FDA approved Transpara, which identifies soft tissue and calcification lesions on mammograms to generate a cancer suspiciousness score. It is being used clinically to improve the decision-making accuracy of radiologists while reading scans.[9]


IBM’s Watson for Oncology accesses over 300 medical journals and textbooks to “read” updated literature, guidelines, over 550 breast cancer cases, as well as patient characteristics, medical history, imaging, and laboratory findings to develop patient-specific breast cancer treatment recommendations. It suggests multiple options for each patient.[10]

ML has the potential to significantly assist physicians in proactive decision making. While AI is predicted to first be incorporated into fields that favor structured data such as radiology, dermatology, and pathology, it is relevant to numerous clinical specialties.[11] Not only can it augment clinical decision making, it can also help with logistic operations, clinic performance analysis, continuing education, professional development, and population health.[4] Furthermore, the use of natural language processing is another subcomponent of AI that deals with transforming unstructured data in the form of clinical notes, patient interviews, or medical journals into structured data that can be analyzed by ML, heightening the utility in qualitative aspects of medicine.[12] In order to harness the utility of AI in physicians’ day-to-day practices, barriers to integration must first be addressed, and physicians must be taught how to evaluate and use such tools.

Challenges of AI integration in medicine

Data availability

The successful integration of AI into medicine will depend largely on the quantity and quality of accessible electronic data. The data are required to train and test ML models, whether diagnostic, treatment, or administrative. The current structure of the health care system makes obtaining this data challenging. The required data are currently stored in isolated data silos used by pharmacies, laboratories, hospitals, clinics, administrative systems, and others. The data need to be collated before they can be used for ML purposes.[13] Furthermore, while the use of EMRs may be widespread among primary care physicians, the use of electronic health records (EHRs) in hospitals remains limited, although it is underway. Barriers to data access are compounded by privacy and security issues related to patient data sharing between health authorities.[13] In 2016, the CMA reported that due to the structure of the system, harnessing the utility of big data analytic opportunities may be restricted to primary care for now.[4] The exchange of data between computer systems and health records is vastly complex, and there is currently no simple, economical way to achieve it.[14] Despite these challenges, there are multiple interoperability plans in the works in BC and nationally, offering an optimistic future.[15,16]

Data quality

When training ML algorithms, patient variables (e.g., age, sex, vitals, and lab results) must be available and represented in a standardized format to allow for aggregation of data sets. Furthermore, these predictor variables should be low cost and commonly collected by physicians.[6] However, the variability in recording and formatting data poses a challenge to its use in ML. Patient data are often reported using common clinical terms rather than standardized vocabulary. The nature of medical notes adds another complication as many components lack mathematical characterization.[13] Without a foundation of vast amounts of high-quality data, ML models can perform poorly.

There are also important considerations regarding ethical challenges, such as the mirroring of human biases in decision making. The application of ML algorithms outside the health care setting has highlighted some of these concerns, including a propensity for racial discrimination. Similar biases may also enter medicine. For instance, if algorithms are trained only on data from certain populations, they may be prone to conclude false generalizations to other groups.[6,17,18] Poor representation of populations has been a long-standing issue in the academic community. For example, the Framingham Heart Study poorly predicts the risk of cardiovascular events in non-white populations due to sampling bias.[19] Thus, it is paramount to establish measures to minimize these biases from affecting ML models through early critical evaluation of methodological practices, preventing exacerbation of the disparities that currently plague the medical community. Physicians must advocate for their patients by being at the forefront to ensure that core ethical principles are preserved in the development of ML tools.

To augment this process, one suggestion is that data warehouses be built before the data are used. This way, the data can be reused by multiple researchers or developers for a variety of purposes and the collecting and cleaning process does not need to be repeated.[13] In addition, having physicians equipped with skills to critically evaluate, develop, and deploy such models will help inform appropriate use.

The future physician’s role

As the role of ML in medicine expands, physicians practising clinical medicine will need to evolve.[17] With the help of ML, physicians will be able to leverage greater amounts of information from the biological, psychological, and social aspects of patients’ lives to further augment personalized patient care. To make this possible, continuous, detailed, and affordable monitoring of patients will contribute to big data sets.[3] However, it is of the utmost importance that physicians have a baseline understanding of the tools they are using. They will have to become familiar with the legal implications of these new tools, as they protect the privacy and security of their patients.

Many argue that medical education must begin to incorporate some of these topics. Despite medical curriculums being time-constrained, it may be necessary to alter teaching strategies and incorporate concepts that are going to revolutionize the field. It may be unrealistic for medical students to become developers or data scientists, but we can begin to train future physicians to become data literate. Students should understand the concepts within ML and AI to critically appraise how data were aggregated and analyzed, and ensure they are appropriately applied.[3] Students should also be competent in navigating the terminology.[6] Furthermore, they should understand what it means to practise in a data-rich environment. One study suggests having a foundation of the four Vs of big data: volume of data being collected, variety of data being collected from different sources, velocity of data generation, and veracity of data (i.e., its quality).[3] In addition, understanding the output of ML algorithms and being able to interpret the prediction will help guide clinical decision making. Lastly, medical students will need to learn how to communicate the information to patients and their families.[20]

The need for this shift in thinking is being recognized. For example, Boston University School of Medicine has an introductory ML course,[21] and Stanford University has an AI-assisted health care course available to their medical students.[22] While curriculums begin to transform, alternate options such as accredited CME courses can be provided to begin educating current medical professionals.


As we experience the transformation of the medical landscape through the integration of AI, many new possibilities arise. Physicians will play an important role in ensuring these new tools will maximize benefits to all health care stakeholders. However, we anticipate challenges that require system-level preparation. Data literacy will enable physicians to evaluate the role of these tools, optimize their use in patient care, and help to mitigate any pitfalls that may arise. This can potentially open the way for active participation in the research and development of AI-centred medical tools. This is an exciting revolution: we will be able to draw from the insight gathered from the culmination of billions of patients and their outcomes to inform the care of future patients.

Competing interests

None declared.

This article has been peer reviewed.


1.    British Columbia Ministry of Health. Setting priorities for the BC health system. Accessed 12 August 2019.

2.    British Columbia Ministry of Health. Enabling effective, quality population and patient-centred care: A provincial strategy for health information management and technology. Accessed 12 August 2019.

3.    Wartman SA, Combs CD. Medical education must move from the information age to the age of artificial intelligence. Acad Med 2018;93:1107-1109.

4.    Canadian Medical Association. CMA policy. Guiding principles for the optimal use of data analytics by physicians at the point of care. 2016. Accessed 12 August 2019.

5.    Norgeot B, Glicksberg BS, Butte AJ. A call for deep-learning healthcare. Nat Med 2019;25:14-15.

6.    Waljee AK, Higgins PD. Machine learning in medicine: A primer for physicians. Am J Gastroenterol 2010;105:1224-1226.

7.    Sagner M, McNeil A, Puska P, et al. The P4 health spectrum—A predictive, preventive, personalized and participatory continuum for promoting healthspan. Prog Cardiovasc Dis 2017;59:506-521.

8.    Slodkowska EA, Ross JS. MammaPrint 70-gene signature: Another milestone in personalized medical care for breast cancer patients. Expert Rev Mol Diagn 2009;9:417-422.

9.    Rodríguez-Ruiz A, Krupinski E, Mordang J-J, et al. Detection of breast cancer with mammography: Effect of an artificial intelligence support system. Radiology 2019;290:305-314.

10.   Somashekhar SP, Sepúlveda M-J, Puglielli S, et al. Watson for Oncology and breast cancer treatment recommendations: Agreement with an expert multidisciplinary tumor board. Ann Oncol 2018;29:418-423.

11.   Mason J, Morrison A, Visintini S. An overview of clinical applications of artificial intelligence. CADTH issues in emerging health technologies 2018;174.

12.    Jiang F, Jiang Y, Zhi H, et al. Artificial intelligence in healthcare: Past, present and future. Stroke Vasc Neurol 2017;2:230-243.

13.    Koh HC, Tan G. Data mining applications in healthcare. J Healthc Inf Manag 2005;19:64-72.

14.    Institute of Medicine (US) Committee on Data Standards for Patient Safety. Aspden P, Corrigan JM, Wolcott J, et al, editors. Patient safety: Achieving a new standard for care. Washington: National Academies Press (US); 2004.

15.    Doctors Technology Office. Health technology guide: EMR data portability. 2018. Accessed 15 August 2019.

16.    Doctors Technology Office. Health Technology Guide: Care Plan Technology Approaches [Doctors of BC]. British Columbia; 2018. Accessed 15 August 2019.

17.    Obermeyer Z, Emanuel EJ. Predicting the future—big data, machine learning, and clinical medicine. N Engl J Med 2016;375:1216-1219.

18.    Char DS, Shah NH, Magnus D. Implementing machine learning in health care—addressing ethical challenges. N Engl J Med 2018;378:981-983.

19.    Gijsberts CM, Groenewegen KA, Hoefer IE, et al. Race/ethnic differences in the associations of the Framingham risk factors with carotid IMT and cardiovascular events. PLoS One 2015;10:e0132321.

20.    Wartman SA, Combs CD. Reimagining medical education in the age of AI. AMA J Ethics 2019;21:E146-152.

21.    ScienceDaily. Educating the next generation of medical professionals with machine learning is essential. 27 September 2018. Accessed 15 August 2019.

22.    Stanford University. MED 277 / CS 337: AI-assisted health care. Accessed 15 August 2019.

Ms Mangalji is a third-year UBC medical student who enjoys exploring how the current practice of medicine could be enhanced, specifically the technologies entering medical practice in rural and remote areas to allow for improved access to quality health care. She hopes this article will spark discussion about how a practitioner’s scope could include data-literacy. Mr Karthikeyan is a third-year UBC medical student interested in the intersection of technology and medicine. During the foundational years of his medical studies, he sought out opportunities to engage in machine learning research. He is excited about the prospective roles that machine learning and artificial intelligence could play in augmenting physicians’ practices. He hopes to contribute to efforts to foster a community of data-literate physicians equipped to leverage technology to improve patient outcomes and care.

Azzra Mangalji, BSc, Vishwathsen Karthikeyan, BHSc. Preparing for the future of medicine: Considering the need for data-literate physicians. BCMJ, Vol. 61, No. 8, October, 2019, Page(s) 325-327 - MDs To Be.

Above is the information needed to cite this article in your paper or presentation. The International Committee of Medical Journal Editors (ICMJE) recommends the following citation style, which is the now nearly universally accepted citation style for scientific papers:
Halpern SD, Ubel PA, Caplan AL, Marion DW, Palmer AM, Schiding JK, et al. Solid-organ transplantation in HIV-infected patients. N Engl J Med. 2002;347:284-7.

About the ICMJE and citation styles

The ICMJE is small group of editors of general medical journals who first met informally in Vancouver, British Columbia, in 1978 to establish guidelines for the format of manuscripts submitted to their journals. The group became known as the Vancouver Group. Its requirements for manuscripts, including formats for bibliographic references developed by the U.S. National Library of Medicine (NLM), were first published in 1979. The Vancouver Group expanded and evolved into the International Committee of Medical Journal Editors (ICMJE), which meets annually. The ICMJE created the Recommendations for the Conduct, Reporting, Editing, and Publication of Scholarly Work in Medical Journals to help authors and editors create and distribute accurate, clear, easily accessible reports of biomedical studies.

An alternate version of ICMJE style is to additionally list the month an issue number, but since most journals use continuous pagination, the shorter form provides sufficient information to locate the reference. The NLM now lists all authors.

BCMJ standard citation style is a slight modification of the ICMJE/NLM style, as follows:

  • Only the first three authors are listed, followed by "et al."
  • There is no period after the journal name.
  • Page numbers are not abbreviated.

For more information on the ICMJE Recommendations for the Conduct, Reporting, Editing, and Publication of Scholarly Work in Medical Journals, visit

BCMJ Guidelines for Authors

Leave a Reply