Jupyter Notebook 中遍历唯一值及计数的代码解析

这段代码的功能是：统计数据框 df_A 中每个 'unit' 对应的 'cycle' 的唯一值数量。

让我们逐步解析这段代码：

for i in np.unique(df_A['unit']):
- df_A['unit']: 从数据框 df_A 中提取名为 'unit' 的列。
- np.unique(): 使用 NumPy 库中的 unique 函数获取 'unit' 列中所有不同的值。
- for i in ...: 遍历 'unit' 列的每一个唯一值，并将当前值赋值给变量 i。
print('Unit: ' + str(i) + ' - Number of flight cyles (t_{EOF}): ', ...)
- 这段代码用于输出结果，其中：
  - 'Unit: ' 和 ' - Number of flight cyles (t_{EOF}): ' 是固定的字符串。
  - str(i) 将当前 'unit' 值 i 转换为字符串类型。
  - 接下来计算并输出该 'unit' 对应的 'cycle' 唯一值数量。
len(np.unique(df_A.loc[df_A['unit'] == i, 'cycle']))
- 这段代码计算 'cycle' 唯一值数量：
  - df_A.loc[df_A['unit'] == i, 'cycle']: 筛选出 df_A 中 'unit' 列等于当前值 i 的所有行，并提取 'cycle' 列。
  - np.unique(...): 获取筛选出的 'cycle' 列中的唯一值。
  - len(...): 计算唯一值的个数。

总结: 这段代码简洁地实现了对数据框特定列进行分组，并统计每组另一列的唯一值数量的功能。通过使用 np.unique 和 df.loc，我们可以高效地完成这项任务。