1 Introduction
2 Preliminary knowledge
3 Method
3.1 Factors in computer science
3.2 Data preprocessing
Table 1. Discretization rules of factors (Sun et al., 2023). |
Variable | Discretization rule |
---|---|
pNov | 0: zero; [0, 0.4]: low; (0.4, 0.6]; median; (0.6, 0.8]: mhigh; (0.8, 1]: high |
pDisrupt | <0: ngtzero; sort pDisrupt values and divide by top percentage interval: (70%, 100%]: low; (30%, 70%]: medium; (10%, 30%]: mhigh; (0%, 10%]: high |
refNum | [0, 10]: ltTen; (10, 20]: tenTwenty; (20, 30]: twentyThirty; > 30: gtThirty |
abRE | >70: easy; (50, 70]: medium; (40, 50]: mhard; (30, 40]: hard; <30: vhard (ref. Flesch, 1948) |
abLen | <600: short; (600, 800]; median; (800, 1000]: long; >1000: vlong |
pNumF | [0, 10]: ltTen; (10, 20]: tenTwenty; (20, 50]: twentyFifty; (50, 100]: FiftyHundred; > 100: gtHundred |
pNumM | |
tcF | [0, 10]: ltTen; (10, 100]: tenHundred; (100, 500]: HundredFiveH; (500, 2000]: fiveHTwentyH; (2000, 10000]: twentyHHundredH; > 10000: gtHundredH |
tcM | |
HIF | [0, 10]: ltTen; (10, 20]: tenTwenty; (20, 30]: twentyThirty; (30, 50]: thirtyFifty; (50, 100]: FiftyHundred |
HIM | |
auCDF | sort auCDF/auCDM values and divide by top percentage interval: (50%, 100%]: low; (20%, 50%]: mlow; (10%, 20%]: medium; (5%, 10%]: mhigh; (0%, 5%]: high |
auCDM | |
instCDF | |
instCDM | |
auNum | 1: one; 2: tow; 3: three; 4: four; 5: five; >5: gtfive |
instNum | |
CNCI | (0, 0.3]: low; [0.3, 0.8]: mlow; (0.8, 1.2]; average; (1.2, 2]: mhigh; (2, 5]: vhigh; >5: exhigh |
3.3 Bayesian network construction
Figure 1. The learned Bayesian network. |
4 Findings
Figure 2. An example of Bayesian network inference by setting Category as journal. |
Figure 3. The distribution of Category by setting various HIM and pNumM and auCDM. |
Figure 4. The distribution of Category by setting pNumM=(FiftyHundred, gtHundred), HIM=(FiftyHundred, gtHundred), auCDM=(mhigh,high). |
Figure 5. mhigh or above CNCI probability by setting various Category and Rank.(a) mhigh or above CNCI probability by setting various category (b) mhigh or above CNCI probability by setting various rank and category |
Figure 6. Distribution of refNum/abLen by setting various Category.(a) Distribution of abLen by setting various category (b) Distribution of refNum by setting various category |
Figure 7. mhigh or above pNov/pDisrupt probability by setting various Category. |
Figure 8. mhigh or above pNov/pDisrupt probability by setting various Category and Rank.(a) the mhigh or high probability pNov by setting various category and rank various category and rank (b) the mhigh or high probability pDisrupt by setting various category and rank |