A股申万一级分类共31个sw1_770000,美容护理sw1_760000,环保sw1_750000,石油石化sw1_740000,煤炭sw1_730000,通信sw1_720000,传媒sw1_710000,计算机sw1_650000,国防军工sw1_640000,机械设备sw1_630000,电力设备sw1_620000,建筑装饰sw1_610000,建筑材料sw1_510000,综合sw1_490000,非银金融sw1_480000,银行sw1_460000,社会服务sw1_450000,商贸零售sw1_430000,房地产sw1_420000,交通运输sw1_410000,公用事业sw1_370000,医药生物sw1_360000,轻工制造sw1_350000,纺织服饰sw1_340000,食品饮料sw1_330000,家用电器sw1_280000,汽车sw1_270000,电子sw1_240000,有色金属sw1_230000,钢铁sw1_220000,基础化工sw1_110000,农林牧渔
(申万一级分类信息从新浪一级分类信息,完整python文章《板块分析1/2》 – 如何获取a股申万分类信息(包括完整性)python点击此链接查看代码)
A股申万二级分类共131个sw2_730200,通信设备sw2_730100,通信服务sw2_721000,电视广播Ⅱsw2_720900,出版sw2_720700,数字媒体sw2_720600,影视院线sw2_720500,广告营销sw2_720400,游戏Ⅱsw2_710400,软件开发sw2_710300,IT服务Ⅱsw2_计算机设备710100sw2_650500,军工电子Ⅱsw2_650400,航海装备Ⅱsw2_650300,地面兵装Ⅱsw2_650200,航空装备Ⅱsw2_650100,航天装备Ⅱsw2_自动化设备640700sw2_640600,工程机械sw2_640500,轨交设备Ⅱsw2_640200,专用设备sw2_640100,通用设备sw2_630800,电网设备sw2_630700,电池sw2_630600,风电设备sw2_630500,光伏设备sw2_其他电源设备630300Ⅱsw2_630100,电机Ⅱsw2_工程咨询服务620600Ⅱsw2_620400,专业工程sw2_620300,基础建设sw2_620200,装修装饰Ⅱsw2_620100,房屋建设Ⅱsw2_610300,装修建材sw2_610200,玻璃玻纤sw2_610100,水泥sw2_510100,综合Ⅱsw2_490300,多元金融sw2_490200,保险Ⅱsw2_490100,证券Ⅱsw2_480500,农商行Ⅱsw2_480400,城商行Ⅱsw2_股份制银行480300Ⅱsw2_国有大银行480200Ⅱsw2_461100,教育sw2_461000,旅游及景区sw2_460900,酒店餐饮sw2_460800,专业服务sw2_460600,体育Ⅱsw2_450700,旅游零售Ⅱsw2_互联网电商450600sw2_450400,专业连锁Ⅱsw2_450300,一般零售sw2_450200,贸易Ⅱsw2_430300,房地产服务sw2_房地产开发430100sw2_421100,航运港口sw2_421000,航空机场sw2_420900,铁路公路sw2_420800,物流sw2_410300,燃气Ⅱsw2_410100,电力sw2_370600,医疗服务sw2_370500,医疗器械sw2_370400,医药商业sw2_370300,生物制品sw2_370200,中药Ⅱsw2_370100,化学制药sw2_360500,文娱用品sw2_360300,家居用品sw2_360200,包装印刷sw2_360100,造纸sw2_350300,饰品sw2_350200,服装家纺sw2_350100,纺织制造sw2_调味发酵340900Ⅱsw2_340800,休闲食品sw2_340700,饮料乳品sw2_340600,非白酒sw2_340500,白酒Ⅱsw2_340400,食品加工sw2_330700,其他家电Ⅱsw2_330600,家电零部件Ⅱsw2_330500,照明设备Ⅱsw2_330400,厨卫电器sw2_330300,小家电sw2_330200,黑色家电sw2_330100,白色家电sw2_280600,商用车sw2_280500,乘用车sw2_摩托车等sw2_280300,汽车服务sw2_280200,汽车零部件sw2_电子化学品270600Ⅱsw2_270500,消费电子sw2_270400,其他电子Ⅱsw2_光学光电子270300sw2_270200,元件sw2_270100,半导体sw2_240600,能源金属sw2_240500,小金属sw2_240400,贵金属sw2_240300,工业金属sw2_金属新材料240200sw2_230500,特钢Ⅱsw2_230400,普钢sw2_230300,冶钢原料sw2_非金属材料220900Ⅱsw2_220800,农化制品sw2_220600,橡胶sw2_220500,塑料sw2_220400,化学纤维sw2_220300,化学制品sw2_220200,化学原料sw2_110900,农业综合Ⅱsw2_110800,动物保健Ⅱsw2_110700,养殖业sw2_农产品加工110500sw2_110400,饲料sw2_110300,林业Ⅱsw2_110200,渔业sw2_110100,种植业sw2_770300,医疗美容sw2_770200,化妆品sw2_770100,个护用品sw2_760200,环保设备Ⅱsw2_760100,环境治理sw2_750300,炼化和贸易sw2_750200,油服工程sw2_750100,油气开采Ⅱsw2_740200,焦炭Ⅱsw2_740100,煤炭开采抓取某浪财经网页申万二级分类信息python完整代码
import requestsfrom bs4 import BeautifulSoupimport refrom operator import itemgetterimport timeimport randomimport pandas as pd\def remove_col(arr, ith): itg = itemgetter(*filter((ith).__ne__, range(len(arr)))) return list(map(list, map(itg, arr))) rl = 'http://vip.stock.finance.sina.com.cn/quotes_service/api/json_v2.php/Market_Center.getHQNodes'# http://vip.stock.finance.sina.com.cn/quotes_service/api/json_v2.php/Market_Center.getHQNodes\heads = {"User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/89.0.4389.90 Safari/537.36"}# url = 'https://www.holidayfrancedirect.co.uk/cottages-holidays/index.htm'esText = requests.get(url)\soup = BeautifulSoup(resText.content, features='lxml') \s = soup.textprint('\申万二级分类:') shw2 = s[s.find('sw1_hy'):s.find('sw2_hy')]shw2_cut = shw2[shw2.find('[['):shw2.find(']]')]shw2_cut = re.sub(r'\\[','',shw2_cut)shw2_list = shw2_cut.split(']')shw2_list_split = []for i in range(0,len(shw2_list)): # len(shw2_list) item_split = shw2_list[i].split(',') if i == 0: temp_str = item_split[0].encode('utf-8').decode('unicode_escape') # print(i,temp_str) item_split[0] = temp_str else: temp_str = item_split[1].encode('utf-8').decode('unicode_escape') # print(i,temp_str) item_split[1] = temp_str item_split = item_split[1:4] \ shw2_list_split.append(item_split) result_shw2 = remove_col(shw2_list_split, 1)print()print(';申万二级分类总数:',len(result_shw2))print(';申万二级分类总数(包括各类股票):',len(shw2_category_and_stocks))\# 把信息写进去csv文件shw2_category = [x[0][0] for x in shw2_category_and_stocks] shw2_code = [x[0][1] for x in shw2_category_and_stocks] dict1 = {'shw2_code': shw2_code,'shw2_category': shw2_category} df1 = pd.DataFrame(dict1) df1.to_csv('shenwan2_category.csv',index = False)\shw2_category_code = [x[1] for x in shw2_categorystock] shw2_category_name = [x[0] for x in shw2_categorystock] shw2_category_mktcode = [x[2] for x in shw2_categorystock] shw2_stock_code = [x[3] for x in shw2_categorystock] shw2_stock_name = [x[4] for x in shw2_categorystock] stock_mktcap = [x[5] for x in shw2_categorystock] nstock_nmc = [x[6] for x in shw2_categorystock] ndict2 = {'shw2_code': shw2_category_code,'category_name': shw2_category_name,'category_mktcode':shw2_category_mktcode, 'stock_code':shw2_stock_code,'stock_name':shw2_stock_name,'stock_mktcap':stock_mktcap,'stock_nmc':stock_nmc} # ndf2 = pd.DataFrame(dict2) ndf2.to_csv('shenwan2_category_stocks.csv',index = False)
shenwan2_category_stocks.csv 格式如下图:
其中每行内容说明如下:
表头
说明
示例
shw2_code
申万二级分类代码
sw2_730200
category_name
申万二级分类名称
通信设备
category_mktcode
申万二级分类市场代码
bj831961
stock_code
股票代码
831961
stock_name
股票名称
创远信科
stock_mktcap
股票总市值
163838.06
stock_nmc
股票流通市值
94781.02
抓取某浪财经A股申万一级分类信息的完整python,在文章《板块分析1/2 – 如何获取A股个股的申万分类信息(含完整python代码)》中,点击这里查看。
(本文完)
版权声明:本文内容由互联网用户自发贡献,该文观点仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容, 请发送邮件至827202335@qq.com 举报,一经查实,本站将立刻删除。文章链接:https://www.eztwang.com/dongtai/66819.html