MCPcopy Index your code
hub / github.com/Turing-Project/WriteGPT / gen_bert_vector

Function gen_bert_vector

LanguageNetwork/BERT/sentence_encoder.py:65–77  ·  view source on GitHub ↗
(data, pad_size=200, )

Source from the content-addressed store, hash-verified

63
64
65def gen_bert_vector(data, pad_size=200, ):
66 model = Bert('./models/pytorch_pretrained_bert/bert_pretrain/', './temp/', load_pretrained_bert=True,
67 bert_config=None)
68 b_data = bert.pre_process(data, tgt=[list('NONE')], oracle_ids=[0], flag_i=0)
69 indexed_tokens, labels, segments_ids, cls_ids, src_txt, tgt_txt = b_data
70 sent_data = {"src": indexed_tokens, "segs": segments_ids}
71
72 src = torch.tensor(_pad([sent_data['src']], 0, pad_size)).to(device)
73 segs = torch.tensor(_pad([sent_data['segs']], 0, pad_size)).to(device)
74 mask = torch.logical_not(src == 0).to(device)
75 sentence_vector = model(src, segs, mask)
76
77 return sentence_vector
78
79
80def add_vector_in_origin_file(path, vector_dict, save_path):

Callers

nothing calls this directly

Calls 4

BertClass · 0.90
_padFunction · 0.85
modelFunction · 0.85
pre_processMethod · 0.45

Tested by

no test coverage detected