Currently, a universally accepted definition of scientific breakthroughs remains elusive, as diverse interpretations and conceptualizations abound. Various scholars proposed different frameworks and principles to characterize this elusive phenomenon. Thomas Kuhn, for instance, posited that the progression of science entails a gradual accumulation of “normal” conditions until the emergence of an anomaly that defined explanation within the confines of the existing theory. This triggers a state of “revolution”, prompting the formulation of a new paradigm to accommodate the evolving scientific landscape. Hollingsworth (
2008) defined a breakthrough as “a finding or process, often preceded by numerous small advances, which leads to a new way of think about a problem…”. Winnink et al. (
2019) considered science breakthroughs to be events that have a major impact on future scientific research and are transformative points in science. Ponomarev et al. (
2014), in addition to describing the significant impact of breakthrough research on science, defined breakthrough research literature in terms of citations as literature that receives a high number of citations and that leads to a change in the direction of research. Building upon the articulated definition and features characterizing scientific breakthroughs, we posit that the developmental trajectories diverge between normal science and breakthrough scenarios. In the initial stages of scientific breakthroughs, anomalies exhibit a tendency to possess a low degree of similarity and relevance to the prevailing scientific paradigm, effectively assuming a status akin to “outliers.” Consequently, this study endeavors to identify potential scientific breakthroughs by scrutinizing anomalous semantic features, such as new theories, innovative approaches, and novel research questions within specific fields. This analytical perspective is framed within the context of outlier identification.