这个 csv 的头
Name,CardNo,Descriot,CtfTp,CtfId,Gender,Birthday,Address,Zip,Dirty,District1,District2,District3,District4,District5,District6,FirstNm,LastNm,Duty,Mobile,Tel,Fax,EMail,Nation,Taste,Education,Company,CTel,CAddress,CZip,Family,Version,id
陈xx,,,OTH,010-116321,M,19000101,北京市海淀区xxx,100080, ,,CHN,0,0,,,,,,10116,010-xx,010-xx-208,[email protected],,,,,,,,0,2012-12-23 11:13:38,2
赵xx,,,ID,21010219880204xxxx,M,1988xxx,-,-, ,,CHN,21,210102,,,,,,186024xxx,-,-,[email protected],,,,,,,,0,2011-3-22 17:35:12,100
比如统计最白菜的名字,或邮箱后缀等
使用的源文件 https://www.copy.com/s/UyiPPS4mZtnZ/Public/shifenzheng.csv.zip
另外问个问题,为什么用 sort 和 uniq 命令排序统计这么省内存,如果用 ruby 实现这样省内存的排序统计