Skip to main content

Artificial intelligence in orthopedic surgery: evolution, current state and future directions


Technological advances continue to evolve at a breath-taking pace. Computer-navigation, robot-assistance and three-dimensional digital planning have become commonplace in many parts of the world. With near exponential advances in computer processing capacity, and the advent, progressive understanding and refinement of software algorithms, medicine and orthopaedic surgery have begun to delve into artificial intelligence (AI) systems. While for some, such applications still seem in the realm of science fiction, these technologies are already in selective clinical use and are likely to soon see wider uptake. The purpose of this structured review was to provide an understandable summary to non-academic orthopaedic surgeons, exploring key definitions and basic development principles of AI technology as it currently stands. To ensure content validity and representativeness, a structured, systematic review was performed following the accepted PRISMA principles. The paper concludes with a forward-look into heralded and potential applications of AI technology in orthopedic surgery.

While not intended to be a detailed technical description of the complex processing that underpins AI applications, this work will take a small step forward in demystifying some of the commonly-held misconceptions regarding AI and its potential benefits to patients and surgeons. With evidence-supported broader awareness, we aim to foster an open-mindedness among clinicians toward such technologies in the future.


The incorporation of technology into everyday medical practice is accelerating at an incredible rate — in few areas more so than in the domain of orthopedic surgery. Real-time navigated, computer-guided [1] and robot-assisted [1, 2] intraoperative input has become commonplace in many regions. Two-dimensional imaging is rapidly being replaced by virtual three-dimensional (3D) displays [3,4,5], and interactive digital, semi-automated or fully-automated preoperative planning and templating are also widely available in many developed settings [3].

While the logic-driven computing processes that underpin such technologies are impressive to say the least, most output functions are the result of ‘human-defined’ iterative pathways with parameters set in keeping with progressive logic principles. The integration of artificial intelligence (AI) machine learning (ML) algorithms into driving decision-making pathways represents an evolution beyond the basic limitations of conscious human learning considerations and accepted logistic analytical regression [6, 7].

Once a realm of science fiction, AI applications have rapidly integrated into accepted ‘everyday’ life around us. Many are surprised to learn for how long ‘everyday’ examples of AI intrusion into our worlds has been commonplace. The AI-driven ‘Google Search’ function came into mainstream use nearly 25 years ago (1998) [8], predicting search patterns and ‘pre-empting’ active searching. In a more contemporary sense, ‘Google Translate’ [9], Facebook’s ‘Phototagger’ (2015) [8], Uber’s rideshare demand prediction, and Apple’s well-known voice-responsive pocket assistant ‘Siri’ [9] are all examples of mature AI algorithms with wide public application.

While orthopedic surgeons have long stood proudly behind the craftmanship of their clinical trade, we can no longer remain ignorant to the global forward creep of technology. As we seek to further improve the quality and outcomes of the services we provide, computer-assisted and AI applications show increasingly apparent opportunities for their integration into everyday practice [1]. The exponentially-expanding volume of information we collect pre-, intra- and post-surgery, and the coupled unprecedented data aggregation rate [10, 11], lends itself almost perfectly to offloading to computer-driven applications [1, 10, 12, 13]. While patient-generated health data (PGHD) [14] are becoming commonplace, the sheer enormity of captured data points for a single patient [15,16,17] yield almost more information than can perceivably be comprehended and managed by the human processing capacity. Even ‘off-the-shelf’ wearable sensors [15, 16] may capture several million discrete data points [14] for small cohorts, or single patients tracked for extended time periods. This mass of stored digital information is collectively known as ‘big data’ [8, 10, 12, 18, 19]. Computers (and computer applications) are ideally suited to objectively receive, categorize and interpret such large amounts of patient- and care-related materials. However, before orthopedic surgeons can confidently relinquish responsibility of data management to computers and overcome the innate biases towards such a process, one must have at least a basic understanding of the potentially advantageous role of computers in life, healthcare and surgery, including how such technologies have evolved, what they are reasonably suited to do, and how we quality-assure their contributions moving forward. The aim of this review is to offer, to the practising clinical orthopedic surgeons, definitions and fundamental understanding of AI and its applications, with the goal of demystifying some of the elements and misconceptions that harbour an oppressive reluctance to engage with technology in our working spheres. It is certainly not the intent to provide an all-encompassing and highly-detailed summary, rather to ‘introduce’ the key topic elements to the naïve reader.


To ensure a relevant, accurate and representative synopsis of the current state of understanding of AI in orthopedics, a formal, structured and systematic search and retrieval of publications was performed according to the accepted Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. The search results are depicted in Fig. 1. Three databases: (i) Cochrane, (ii) EMBASE and (iii) Medline, were searched from inception until 31 August 2021. Search results were limited, in the first instance, to articles available in the English language with available abstracts.

Fig. 1
figure 1

PRISMA search summary. * Tues Aug 31 03:05:102021 Search: ((artificial intelligence OR AI) AND (orthopaedics OR orthopedics) AND (arthroplasty OR joint replacement) AND (TKR OR THR OR THA OR TKA) AND (English\[Language]))

Initially, 353 articles were identified during preliminary database searching. After exclusion of duplicates, articles which did not match the search intent (i.e. papers not specifically exploring AI applications) and articles not available in full text form, 65 full-text papers were manually reviewed. At the end of the review process, 52 articles were deemed appropriate for inclusion. As a relatively new topic in the field of medicine, it was found that there existed a lack of quantitative research within the domain AI in surgery, thus preventing formal ‘meta-analysis’ per se. With the preserved intent of providing a contemporary synopsis of the topic, a structured review of the identified literature was performed in keeping with meta-synthesis principles.

Artificial intelligence

The term ‘artificial intelligence’ was coined by John McCarthy in 1956 [8, 11], originally as a theoretical proposition of a future stage whereby computers would ‘learn’ to perform automated tasks through algorithmic pattern recognition with limited (if any) direct human input [8]. In a more practical sense, AI has represented a state of ‘cognitive unloading’ whereby a computer can be tasked with the activity of scrutinizing large volumes of data in time frames uncomprehendable by the conscious human mind. These computers follow programmed algorithms (sets of computational instructions) to perform highly specific tasks. This may include ‘categorizing’, ‘identifying’ or ‘linking’ discrete variables, and specific factors may be selectively weighted more or less favorably [13] in decision-making analysis. The practical value of such task delegation becomes the ability to sift through large volumes of information in rapid time [20,21,22]. Depending on the assigned ‘task’, the computer(s) may identify trends or ‘patterns’ [8] in the data set which can be used to demonstrate attribute association. In its simplest form, the computer is offered a ‘training’ dataset [3, 23] (often a subset of a larger volume of known information). In clinical orthopedic practice, training datasets will routinely contain information pertaining to several thousand individual patients [21, 24]. With highly-specified coding (instructions), the computer ‘learns’ [25] to recognize specific features. The accuracy with which the computer can do this is often then compared to a known ‘gold standard’ (historically a human-defined standard). Large volumes of data can be fed into the AI system which ultimately seeks to ‘pattern recognize’ with great speed and reproducibility, usually linking to a pre-determined outcome or series of outcomes. The accuracy of the system can be progressively refined as the algorithm is adjusted to facilitate progressively more accurate assessment of the dataset. The new outcome is again measured against a gold standard, the accuracy determined, and error improvement measures feed back into the index algorithm. In this sense, the AI system is ‘taught’ to be more precise/accurate. This process has been likened to the process of progressive human learning whereby sequential exposure and re-visiting reinforces understanding.

The AI training cycle can be repeated (usually many times) with sequential algorithm amendments until an acceptable level of precision is achieved. Each full learning cycle (referred to as an ‘epoch’) allows progressive refinement of the discriminatory AI algorithm, improving the system accuracy. Depending on the size and complexity of the training dataset, and the desired final system ‘accuracy’ requirements, many conventional AI systems will be looped through between 10 and 1000 epochs [7, 23, 26, 27]. This process of ‘teaching’ the computer to perform a semi-automated task is referred to as ‘machine learning’ [10, 12, 13, 28,29,30]. While for certain tasks a highly accurate and reproducible outcome can be achieved, this process is usually time- and human-input-intensive to achieve initial establishment. Once a ‘final’ algorithm has been settled upon (with precision deemed acceptable to the task at hand), the AI system can be applied independently to future (previously unseen) datasets and, through predictive modelling [31], be used to determine a likely outcome based on complex multivariate analysis.

As a practical example, AI algorithms have been applied to arthroplasty component recognition from plain radiographs [32]. The algorithm is given information regarding key imaging features that a ‘human’ interpreter might use to discern one type or brand of implant from another. This might include information regarding length/size, proximal body versus stem segment proportions, changes in curvature, neck-shaft angles, the presence or absence of a collar etc. A training set of images is presented to the system (i.e. a series of plain radiographs of implants for which the actual brand/model is known). The AI system then assesses the images trying to match radiographic features to known (inputted) implant parameters. After this, the process of image assessment [26] has been completed, the ‘accuracy’ of the system is established versus the known (correct) answers. Where the system has been inaccurate, additional detail/information can be fed back into the algorithm (i.e. manually entered) and the training epoch repeated. This cyclic process can be repeated – with sequential addition of information to the algorithm – until a level of accuracy deemed ‘acceptable’ is achieved. The AI system is then ready for ‘real life’ application.

With progressive computational power and algorithm refinement capacity, the AI system can be ‘taught’ to self-evaluate its own performance and amend its own internal algorithmic codes to improve performance [8]. The index AI algorithm refines with growing exposure to training datasets, sequentially improving iteration accuracy and ultimately maximizing predictive power [13]. While this seems simply a more refined version of the previously described ML processes, it requires an entirely different iterative programming basis whereby the program has capability and autonomy to ‘self-write’ its own coding instructions, a step towards true ‘automation’ [33, 34]. In doing so, it eliminates the need for direct human input/involvement in the algorithm refinement process and can greatly reduce the timeframe required to achieve a viable/usable system. This version of AI is referred to as ‘deep learning (DL)’ [10, 20, 23, 25, 32]. The system starts with a set of pre-determined key outcomes and known, linked, associative variables. It progressively re-refines its cluster association capacity with each new epoch, improving accuracy. The algorithmic functionality of modern DL neural networks [3] allows the artificial establishment of multi-layered ‘evolutionary plexuses’ that have been conceptually likened to the human neurons [11]. Most DL systems consist of some form of artificial neural network (ANN) [22, 25, 28], a series of iterative processing steps between an ‘input’ layer (for example where the data being considered are entered) and a final ‘output’ layer. A stylized depiction is shown in Fig. 2. Each initial input variable or data point can be linked to one or more (or all) evaluative steps in the next layer. In turn, each discrete data point in the second layer can be linked to one or more (or all) evaluative steps in the next layer, and so on. These layers between the initial input and final output layers are referred to as ‘hidden layers’. The larger the number of input points, and the larger the number of neural ‘hidden layers’, the greater the complexity of analysis (and the greater the demand upon computer processing power). With increasing sophistication of inter-relation, data points embedded within individual layers can be influenced in a selectively disproportionate manner by adjacent data points (in a sense, the influence of these points can be weighted more heavily in influencing the final output) — such complex systems are commonly referred to as deep convolutional neural networks [CNNs] [26, 35,36,37].

Fig. 2
figure 2

A stylized deep neural network depiction. Source:

There are many potential advantages to such true DL ‘artificial intelligence’ (beyond the value of timeliness), including being free from the otherwise incumbent limitations of human inconsistency, fatigue [8], cognitive judgement error [8], selection bias and poor inter-user correlation — all of which plague manual (human) assessment of large-volume multi-focal datasets. Dealing with such enormous sets of information, of almost infinite complexity [9], AI systems have the potential to identify variable associations that have not previously been considered [15] in an evidence-based ‘manual’ manner — in a sense, amplifying human cognition [9]. Many argue that this is where the ‘true value’ of AI in healthcare may lie in the future (inter-related risk factor association to desired or known clinical endpoints is an obvious exemplar). As appealing as this may sound, current AI systems are equally not without their limitations and inherent problems. Beyond the need for huge processing capacity (and associated expense), AI systems are currently only as good as the human processes that establish them. The ‘garbage in, garbage out’ mantra holds true here. If the index AI algorithms are set up with intrinsic errors or fundamental mis-assumptions, these will be propagated through the process and result in erroneous outputs. The further the system self-refines, the greater the sequential error can become. While the system’s primary objective may be set to achieve outcome ‘accuracy’, it may internally achieve this by refinement changes that violate basic human logic principles to best ‘achieve’ its desired task.

Essential requirements for AI

In essence, there are three basic prerequisites for the development of an AI system. Firstly, a large volume of data pertinent to the topic of interest must exist and be able to be inputted in some predictable and consistent manner to the system itself. Data entry and/or extraction are often underappreciated key elements of the AI process. Imprecise or unrepresentative data provision will ultimately lead to erroneous determinants and outputs. Secondly, substantial computing and data processing power (capacity) is needed [8, 12, 19]. In many iterations, this has been the rate-limiting step to the forward advancement of AI applications. As discussed previously, as the number of data input sources and the number of ‘hidden layers’ increase – especially within a deep CNN – the demand on computer processing capacity increases almost exponentially. Computer ‘processing’ capacity is commonly measured in units known as ‘Floating Operations Per Second (FLOPS)’. In comparing the most powerful computer processors from 1955 to 2015, there has been a one-trillion-fold increase in FLOPS [19]. As a more practical example, the computer processing unit (CPU) of the entire Apollo 11 moon landing guidance computer in 1969 was estimated to be approximately 2 MHz; by comparison, a pocket-held iPhone 12 personal mobile phone has a CPU speed on 3.1 GHz (i.e. 3100 MHz) – that being a more than 1500 times greater processing power.

The third requirement of an AI system is an underlying algorithm. Given the wide diversity of AI applications (well beyond the closed sphere of medicine), different AI algorithms have evolved to best manage specific problems [38]. While a detailed synopsis falls beyond the scope of this paper, a basic summary of common AI algorithms is shown in Table 1. While many (if not all) of these algorithmic subtypes have conceivable application in medicine and orthopedic surgery specifically, ‘Association Rule Learning Algorithms’, ‘Deep Learning Algorithms’ and ‘Artificial Neural Network Algorithms’ are perhaps the most commonly encountered ones to date.

Table 1 Common AI algorithm types

Potential advantages of AI in orthopedics

The are many potential advantages to the incorporation of AI into everyday orthopedic practice. There are conceivably diagnostic, decision-making and technique execution, and administrative considerations. Prior to surgery, AI applications have already been employed to either improve the accuracy of, or reduce the time associated with critical diagnostic steps. This may involve recognition and classification of pathology, or the correct determination of in situ implant types/models. The excellent discriminative capacity [9] of digital image-based AI applications lends itself to such uses. This has clear and immediate implications for revision surgery whereby accurate identification of the current patient implant (often reflecting models no longer in routine use) is critical for salvage option planning and equipment ordering. Several high-quality papers have already been published in this domain reporting AI system precision in the preoperative image-based identification on in situ hip and knee replacement constructs. Such papers described ‘near-perfect’ functionality [20, 27] with accuracies as high as 99.6% [23] or better [35] claimed in the correct identification of limited series of included implant types. Additionally, all of the available clinical applications in automated implant recognition cited significant time-saving during this process [20,21,22]. Murphy et al. (2021) reported an average implant recognition time of just 0.96 s using an out-dated off-the-shelf iPhone 6 after photographing the relevant patient anteroposterior (AP) pelvic radiographs [22]. This incredible time-saving is already being touted as an enormous potential cost-saver through reducing the time otherwise spent by clinicians to perform such similar ‘manual identification’ tasks [20]. There are many more such examples of ‘real life’ clinical applications of AI technologies which are explored in the next section.

Common decision-making activities can also be supported by, or offloaded to, AI applications. In the setting of a well-established and informed AI system, the ability to associate and consider a myriad of linked patient variables provides, in many circumstances, superior risk prediction [39], which can be used prospectively to achieve evidence-supported risk mitigation [6]. Similarly, by assessment or incorporation of postoperative patient-reported outcome measure (PROM) data, AI algorithms can be trained to pattern-recognize key elements (singularly or multi-factorially) which contribute to optimized clinician- and/or patient-defined outcomes. Used appropriately, this information can be looped back into patient screening and decision-making processes and may be helpful in preoperative expectation management [28] and allowing provision of objective feedback regarding the quality of delivered care [33]. Both considerations may be used to ultimately improve patient outcomes [24, 29, 35]. Inextricably linked to this prospective risk-determination capacity is the ability to subgroup patients based on perceived risk profile, allowing episode-of-care cost prediction [18, 25, 28]. Such analyses have already been applied to arthroplasty cohorts of group sizes approaching 150,000 patients [undergoing total knee arthroplasty (TKA)] [18].

From an information management perspective, AI algorithms have already been shown to hold great value. The two key elements of improved data management accuracy and time saving (largely through amelioration of the need for a human time commitment) are central here. Both have undeniable potential for considerable healthcare cost savings [20]. As patient medical records transition more and more uniformly to electronic based medical records (EMRs), the burden of near-oppressive data gathering volumes plagues many developed health networks. With so much volumetric data available, how does a single clinician sort through and retrieve key elements critical for point-of-care decision making? Published forays into EMR data synthesis and extraction have shown very positive potential [21]. Such applications – after only modest epoch training cycles — have been shown to provide higher key information extraction accuracy than ‘manual’ record searching, but with far greater speed [33]. A recently published paper by van de Meulebroucke and colleagues (2019) showed an impressive accuracy of over a 95% in EMR data extraction using AI in a cohort of nearing 550 patients in a non-academic healthcare setting [33].

Clinical applications

Despite being still considered by many a ‘novel’ field, AI in orthopedic surgery is being used with ever-increasing frequency. Large volumes of patient data, ever-increasing patient expectations of positive post-surgical outcomes, and a profession-driven push to improve the quality and precision of care we offer have opened many avenues for AI use. Orthopedic applications include image recognition (diagnostics/implant identification [32]), risk prediction, cost-outcome determinations, clinical decision making seem popular early targets of AI technologies. Applications in primary TKA [7], primary total hip arthroplasty (THA) and resurfacing [23], and primary total shoulder arthroplasty (TSA) [18, 28] have all been reported with positive value.

In a preoperative sense, AI has been used and reported in areas such as: length-of-stay [18, 25, 28] and episode-of-care cost prediction, each with reportedly ‘excellent’ validity [18]. Grading of osteoarthritis from plain radiographs was shown in early versions to be as accurate as fellowship-trained arthroplasty surgeons in making the same determination, but significantly quicker [37]. As to pathologic feature recognition [8, 27] (including fractures), its precision has been further improved by recursive feature elimination [40] whereby training datasets can be refined to allow more ‘targeted’ feature-of-interest recognition (thus reducing image feature ‘noise’ that may otherwise degrade the accuracy of human interpretation), 3D templating and operative planning [3] having achieved accuracy of 90% or greater compared to just 56.7% for conventional acetate-based manual planning and the patient-specific influence of pelvic sagittal inclination [34] on THA construct stability being able to be determined to inform intraoperative acetabular component placement. Discharge destination prediction [25, 28, 41], the likelihood of prolonged opioid prescription after THA [6], and the likelihood of blood transfusion after TKA [42] have also been successfully reported. Even more user-friendly web-based applications [41, 42] for discharge destination and RBC transfusion likelihood have also been tested in live clinical scenarios.

The automated image processing [27] capacity of AI has also been applied to the diagnosis of periprosthetic joint infection [24] against the Musculoskeletal Infection Society (MSIS) standard; peri-prosthetic fracture classification [21] as per the Vancouver classification, with purported sensitivity of 100 and specificity of 99.8%; and diagnosis of periprosthetic component loosening [36] of both THA and TKA constructs, with an overall accuracy of 88.3%. Postoperatively, the overall prospective determination of the likelihood of future need for TKA revision has also been demonstrated with high utility in a large cohort of 25,104 patients after the primary procedure [7], as ha the value in automated postoperative monitoring and outcome assessment [13], risk prediction [8], including the likelihood of dislocation after primary THA [26]. Ultimately, applications already in research or early clinical use have been shown to predict [40, 43, 44] and/or improve patient satisfaction [45] and overall outcomes [19].

Looking into the future

The principal value of AI appears to hinge around data-driven optimiszed outcomes [12]. The technology has already been shown to have value in supporting [9, 29, 39, 45] or driving clinical decision making [8, 13]. The feedback loop incorporation of PROMs [14, 29] continues to help improve both subjective patient satisfaction and clinical outcomes [19, 45]. Large studies have already been completed after hip and knee replacement surgery exploring the minimum clinically-important differences in multiple standard PROMs [13, 46] to better inform the differentiation between ‘statistical’ and ‘actual’ clinical differences. With increasing refinement of complex discriminative AI algorithms, one could anticipate this to improve in the future. Pre-emptive risk profiling for potential perioperative adverse events or major complications [39] after arthroplasty surgery will also help to inform patient-centric consenting and material risk determination. Such robust preoperative information will undoubtedly be utilized to drive more patient-individualized reimbursement and episode-of-care payment models [8].

As medicine and surgery push onward towards a seemingly inevitable uptake of technology-assisted data management (including potentially AI applications), there presents a unique opportunity for a more universal commitment to ‘standardization’ of prospective patient data capture and storage. This will facilitate future seamless integration of EMRs with PGHD systems [14] — and subsequent data extraction — to allow optimized post-hoc data management and analysis. The current inconsistencies and incompatibilities in fundamental computing platform and operating systems stand as a key barrier to potentially more meaningful uptake. While ‘open access’ platforms would facilitate greater data usability, in their current form, such systems lack the intrinsic patient privacy and confidentiality requirements necessary to support wider healthcare use. The opportunity for further work in this realm clearly exists such that protection of patients’ interests and rights keep pace with the desire for technological advancement.


Artificial intelligence applications are already in widespread use in medicine and the sub-fields of orthopedics and have already shown great potential to transform care [8, 10]. The ability of refined algorithms to draw upon digital information readily stored in large database and registry [7, 18, 25, 28, 30] repositories further improves the value, accuracy and practical relevance of the outcomes reported. Particularly self-determining systems, such as the newer deep CNNs [26, 35,36,37], have increasing capacity for the management of complex datasets [9] and may allow the identification of variable associations not previously conceived through conventional human-driven analytics. Certainly others involved in the development of such technologies are enthusiastic about the future and those immersed in such current clinical applications espouse AI as holding very real potential to expand the horizons of orthopedics [12, 26].

Much still needs to be done, however, before AI applications can be generally accepted into mainstream care [11]. In the current setting, the computing hardware requirements alone carry great intrinsic expense and within global healthcare systems under ever-increasing cost pressure the burden of utilitarianistic parsimony [29] remains an often rate-limiting road block. Many novel ‘research’ applications still require verification and validation in the wider clinical setting [3] or at least result duplication remote from the index institution(s). Many impartial observers concede only a role for AI applications in augmenting clinical decision-making, not yet being in a position to replace it [36] — in many areas, this likely holds true.

It is important to note that, with some applications, preliminary research has suggested no value over conventional approaches. For example, the recent 2020 publication by Pau and colleagues [30] suggested that AI self-learning algorithms did not outperform simple logistic regression in predicting postoperative walking limitation after joint replacement surgery. Similarly, a large registry-based AI analysis in Denmark failed to show a meaningful benefit in the application of ML algorithms to the prediction of future early revision need following primary TKA [7], despite using four different, purpose-designed AI models. The authorship group did concede, however, that their iterative logistic regression approaches may have lacked the necessary embedded comorbidity data required to allow sufficient predictive value and demonstrate a ‘true’ benefit.


As technology advances further in the world around us, the role that AI applications play in orthopedics is already seen by many as an inevitable future step. Much investment and early work have already explored the optimization of early utilization of such approaches, across a range of clinical and para-clinical domains. Like any new technology, effort must be expended to ensure AI applications touted for clinical use have shown evidence-based rationales for adoption with non-inferiority (but ideally improved) outcomes against the conventional standards. It is important that the science underpinning such advancements is not outpaced by the hype. While it is likely that computer- and AI-based programs will add value to areas where human cognition and capacity stand as rate-limiting factors, the expense and effort required to establish such systems must be positively weighed against the perceivable benefit. Current generation AI algorithms, particularly deep convoluted CNNs, lend themselves to image feature recognition and multi-variate risk analysis/outcome prediction and these are the current areas of greatest research interest. Volumetric data management and prospective episode-of-care/payment model stratification are also being actively explored. Being such a novel and unprecedented frontier, AI in medicine presently lacks widely-accepted governance and regulatory provisions which will need to evolve at a similar rate to ensure safe and optimal utilization with respect for individual patient data and circumstances.

Availability of data and materials

Not applicable.



Three dimensional


Artificial intelligence


Artificial neural network




Convolutional neural network


Computer processing unit


Deep learning


Electronic medical record


Floating Operations Per Second


Machine learning


Musculoskeletal Infection Society


Patient generated health data


Preferred reporting items for systematic reviews and meta-analyses


Patient-reported outcomes measures


Total knee arthroplasty


Total hip arthroplasty


Total shoulder arthroplasty


  1. Bini SA. Artificial intelligence, machine learning, deep learning, and cognitive computing: what do these terms mean and how will they impact health care? J Arthroplast. 2018 Aug;33(8):2358–61.

    Google Scholar 

  2. Haeberle HS, Helm JM, Navarro SM, Karnuta JM, Schaffer JL, Callaghan JJ, et al. Artificial intelligence and machine learning in lower extremity arthroplasty: a review. J Arthroplast. 2019 Oct;34(10):2201–3.

    Google Scholar 

  3. Helm JM, Swiergosz AM, Haeberle HS, Karnuta JM, Schaffer JL, Krebs VE, et al. Machine Learning and Artificial Intelligence: Definitions, Applications, and Future Directions. Curr Rev Musculoskelet Med. 2020;13(1):69–76.

    PubMed  PubMed Central  Google Scholar 

  4. Myers TG, Ramkumar PN, Ricciardi BF, Urish KL, Kipper J, Ketonis C. Artificial intelligence and Orthopaedics: an introduction for clinicians. J Bone Joint Surg Am. 2020;102(9):830–40.

    PubMed  Google Scholar 

  5. Beyaz S. A brief history of artificial intelligence and robotic surgery in orthopedics & traumatology and future expectations. Jt Dis Relat Surg. 2020;31(3):653–5.

    PubMed  PubMed Central  Google Scholar 

  6. Magan AA, Kayani B, Chang JS, Roussot M, Moriarty P, Haddad FS. Artificial intelligence and surgical innovation: lower limb arthroplasty. Br J Hosp Med (Lond). 2020;81(10):1–7.

    CAS  Google Scholar 

  7. Ramkumar PN, Haeberle HS, Bloomfield MR, Schaffer JL, Kamath AF, Patterson BM, et al. Artificial intelligence and arthroplasty at a single institution: real-world applications of machine learning to big data, value-based care, Mobile health, and remote patient monitoring. J Arthroplast. 2019;34(10):2204–9.

    Google Scholar 

  8. Fontana MA, Lyman S, Sarker GK, Padgett DE, MacLean CH. Can machine learning algorithms predict which patients will achieve minimally clinically important differences from Total joint arthroplasty? Clin Orthop Relat Res. 2019;477(6):1267–79.

    PubMed  PubMed Central  Google Scholar 

  9. Karnuta JM, Luu BC, Roth AL, Haeberle HS, Chen AF, Iorio R, et al. Artificial intelligence to identify arthroplasty implants from radiographs of the knee. J Arthroplast. 2021;36(3):935–40.

    Google Scholar 

  10. Wu D, Liu X, Zhang Y, Chen J, Tang P, Chai W. Research and application of artificial intelligence based three-dimensional preoperative planning system for total hip arthroplasty. Zhongguo Xiu Fu Chong Jian Wai Ke Za Zhi. 2020;34(9):1077–84.

    PubMed  Google Scholar 

  11. Karnuta JM, Haeberle HS, Luu BC, Roth AL, Molloy RM, Nystrom LM, et al. Artificial intelligence to identify arthroplasty implants from radiographs of the hip. Arthroplasty. 2020;S0883-5403(20):31206–7.

    Google Scholar 

  12. Karnuta JM, Churchill JL, Haeberle HS, Nwachukwu BU, Taylor SA, Ricchetti ET, et al. The value of artificial neural networks for predicting length of stay, discharge disposition, and inpatient costs after anatomic and reverse shoulder arthroplasty. J Shoulder Elb Surg. 2020;29(11):2385–94.

    Google Scholar 

  13. Ramkumar PN, Karnuta JM, Navarro SM, Haeberle HS, Scuderi GR, Mont MA, et al. Deep learning preoperatively predicts value metrics for primary Total knee arthroplasty: development and validation of an artificial neural network model. Arthroplasty. 2019;34(10):2220–2227.e1.

    Google Scholar 

  14. Borjali A, Chen AF, Muratoglu OK, Morid MA, Varadarajan KM. Detecting total hip replacement prosthesis design on plain radiographs using deep convolutional neural network. J Orthop Res. 2020;38(7):1465–71.

    PubMed  Google Scholar 

  15. Van de Meulebroucke C, Beckers J, Corten K. What can we expect following anterior Total hip arthroplasty on a regular operating table? A validation study of an artificial intelligence algorithm to monitor adverse events in a high-volume. Nonacademic Setting J Arthroplasty. 2019 Oct;34(10):2260–6.

    PubMed  Google Scholar 

  16. Fu S, Wyles CC, Osmon DR, Carvour ML, Sagheb E, Ramazanian T, et al. Automated detection of Periprosthetic joint infections and data elements using natural language processing. J Arthroplast. 2021;36(2):688–92.

    Google Scholar 

  17. Harris AHS, Kuo AC, Bowe TR, Manfredi L, Lalani NF, Giori NJ. Can machine learning methods produce accurate and easy-to-use preoperative prediction models of one-year improvements in pain and functioning after knee arthroplasty? J Arthroplast. 2021;36(1):112–117.e6.

    Google Scholar 

  18. Farooq H, Deckard ER, Ziemba-Davis M, Madsen A, Meneghini RM. Predictors of patient satisfaction following primary Total knee arthroplasty: results from a traditional statistical model and a machine learning algorithm. J Arthroplast. 2020;35(11):3123–30.

    Google Scholar 

  19. Leopold SS. Editor's spotlight/take 5: can machine learning algorithms predict which patients will achieve minimally clinically important differences from Total joint arthroplasty? Clin Orthop Relat Res. 2019;477(6):1262–6.

    PubMed  PubMed Central  Google Scholar 

  20. Bini SA, Shah RF, Bendich I, Patterson JT, Hwang KM, Zaid MB. Machine learning algorithms can use wearable sensor data to accurately predict six-week patient-reported outcome scores following joint replacement in a prospective trial. J Arthroplast. 2019;34(10):2242–7.

    Google Scholar 

  21. Kunze KN, Polce EM, Sadauskas AJ, Levine BR. Development of machine learning algorithms to predict patient dissatisfaction after primary Total knee arthroplasty. J Arthroplast. 2020;35(11):3117–22.

    Google Scholar 

  22. Lu Y, Khazi ZM, Agarwalla A, Forsythe B, Taunton MJ. Development of a machine learning algorithm to predict nonroutine discharge following Unicompartmental knee arthroplasty. J Arthroplast. 2021;36(5):1568–76.

    Google Scholar 

  23. Shah RF, Bini SA, Martinez AM, Pedoia V, Vail TP. Incremental inputs improve the automated detection of implant loosening using machine-learning algorithms. Bone Joint J. 2020;102-B(6_Supple_A):101–6.

    PubMed  Google Scholar 

  24. Tibbo ME, Wyles CC, Fu S, Sohn S, Lewallen DG, Berry DJ, et al. Use of natural language processing tools to identify and classify Periprosthetic femur fractures. Arthroplasty. 2019;34(10):2216–9.

    Google Scholar 

  25. Pua YH, Kang H, Thumboo J, Clark RA, Chew ES, Poon CL, et al. Machine learning methods are comparable to logistic regression techniques in predicting severe walking limitation following total knee arthroplasty. Knee Surg Sports Traumatol Arthrosc. 2020;28(10):3207–16.

    PubMed  Google Scholar 

  26. Navarro SM, Wang EY, Haeberle HS, Mont MA, Krebs VE, Patterson BM, et al. Machine learning and primary Total knee arthroplasty: patient forecasting for a patient-specific payment model. J Arthroplast. 2018;33(12):3617–23.

    Google Scholar 

  27. Schwartz AJ, Clarke HD, Spangehl MJ, Bingham JS, Etzioni DA, Neville MR. Can a convolutional neural network classify knee osteoarthritis on plain radiographs as accurately as fellowship-trained knee arthroplasty surgeons? J Arthroplast. 2020;35(9):2423–8.

    Google Scholar 

  28. Kunze KN, Karhade AV, Sadauskas AJ, Schwab JH, Levine BR. Development of machine learning algorithms to predict clinically meaningful improvement for the patient-reported health state after Total hip arthroplasty. J Arthroplast. 2020;35(8):2119–23.

    Google Scholar 

  29. Bloomfield RA, Williams HA, Broberg JS, Lanting BA, McIsaac KA, Teeter MG. Machine learning groups patients by early functional improvement likelihood based on wearable sensor instrumented preoperative timed-up-and-go tests. J Arthroplast. 2019;34(10):2267–71.

    Google Scholar 

  30. Shah AA, Devana SK, Lee C, Kianian R, van der Schaar M, SooHoo NF. Development of a novel, potentially universal machine learning algorithm for prediction of complications after Total hip arthroplasty. J Arthroplast. 2021;36(5):1655–1662.e1.

    Google Scholar 

  31. Karhade AV, Schwab JH, Bedair HS. Development of machine learning algorithms for prediction of sustained postoperative opioid prescriptions after Total hip arthroplasty. J Arthroplast. 2019;34(10):2272–2277.e1.

    Google Scholar 

  32. El-Galaly A, Grazal C, Kappel A, Nielsen PT, Jensen SL, Forsberg JA. Can machine-learning algorithms predict early revision TKA in the Danish knee arthroplasty registry? Clin Orthop Relat Res. 2020;478(9):2088–101.

    PubMed  PubMed Central  Google Scholar 

  33. Murphy M, Killen C, Burnham R, Sarvari F, Wu K, Brown N. Artificial intelligence accurately identifies total hip arthroplasty implants: a tool for revision surgery. Hip Int. 2021;8:1120700020987526.

    Google Scholar 

  34. Forsberg JA. CORR insights®: what is the accuracy of three different machine learning techniques to predict clinical outcomes after shoulder arthroplasty? Clin Orthop Relat Res. 2020;478(10):2364–6.

    PubMed  PubMed Central  Google Scholar 

  35. Jayakumar P, Moore MG, Furlough KA, Uhler LM, Andrawis JP, Koenig KM, et al. Comparison of an artificial intelligence-enabled patient decision aid vs educational material on decision quality, shared decision-making, patient experience, and functional outcomes in adults with knee osteoarthritis: a randomized clinical trial. JAMA Netw Open. 2021;4(2):e2037107.

    PubMed  PubMed Central  Google Scholar 

  36. Jacofsky DJ, Allen M. Robotics in arthroplasty: a comprehensive review. J Arthroplast. 2016;31(10):2353–63.

    Google Scholar 

  37. Zhao JX, Su XY, Zhao Z, Xiao RX, Zhang LC, Tang PF. Radiographic assessment of the cup orientation after total hip arthroplasty: a literature review. Ann Transl Med. 2020;8(4):130.

    PubMed  PubMed Central  Google Scholar 

  38. Ramkumar PN, Haeberle HS, Ramanathan D, Cantrell WA, Navarro SM, Mont MA, et al. Remote patient monitoring using Mobile health for Total knee arthroplasty: validation of a wearable and MachineLearning-based surveillance platform. J Arthroplast. 2019;34(10):2253–9.

    Google Scholar 

  39. Yi PH, Wei J, Kim TK, Sair HI, Hui FK, Hager GD, et al. Automated detection & classification of knee arthroplasty using deep learning. Knee. 2020;27(2):535–42.

    PubMed  Google Scholar 

  40. Batailler C, Swan J, Sappey Marinier E, Servien E, Lustig S. New Technologies in Knee Arthroplasty: current concepts. J Clin Med. 2020;10(1):47.

    PubMed Central  Google Scholar 

  41. Infographic MCP. The growth of computer processing power over the last six decades … . Offgrid magazine; 2017.

    Google Scholar 

  42. Kurmis AP. The developing role of knee MRI in musculo-skeletal radiology: the progression to 3-D imaging. The Radiographer. 2001;48(1):21–8.

    Google Scholar 

  43. Góralewicz B. Types of AI Algorithms. Accessed online at: 14:47 31/08/2021.

  44. Rouzrokh P, Ramazanian T, Wyles CC, Philbrick KA, Cai JC, Taunton MJ, et al. Deep learning artificial intelligence model for assessment of hip dislocation risk following primary Total hip arthroplasty from postoperative radiographs. J Arthroplast. 2021;36(6):2197–2203.e3.

    Google Scholar 

  45. Jo C, Ko S, Shin WC, Han HS, Lee MC, Ko T, et al. Transfusion after total knee arthroplasty can be predicted using the machine learning algorithm. Knee Surg Sports Traumatol Arthrosc. 2020;28(6):1757–64.

    PubMed  Google Scholar 

  46. Jodeiri A, Zoroofi RA, Hiasa Y, Takao M, Sugano N, Sato Y, et al. Fully automatic estimation of pelvic sagittal inclination from anterior-posterior radiography image using deep learning framework. Comput Methods Prog Biomed. 2020;184:105282.

    Google Scholar 

Download references





Author information

Authors and Affiliations



AK Invitation, conceptualisation, drafting, writing, proofing, final approval. JI Drafting, writing, proofing. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Andrew P. Kurmis.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests and they were not involved in the journal’s review of or decisions related to, this manuscript.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Kurmis, A.P., Ianunzio, J.R. Artificial intelligence in orthopedic surgery: evolution, current state and future directions. Arthroplasty 4, 9 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Artificial intelligence
  • Arthroplasty
  • AI
  • Machine learning