Matches in SemOpenAlex for { <https://semopenalex.org/work/W3039539959> ?p ?o ?g. }
- W3039539959 abstract "E-commerce customers in developing nations like India tend to follow no fixed format while entering shipping addresses. Parsing such addresses is challenging because of a lack of inherent structure or hierarchy. It is imperative to understand the language of addresses, so that shipments can be routed without delays. In this paper, we propose a novel approach towards understanding customer addresses by deriving motivation from recent advances in Natural Language Processing (NLP). We also formulate different pre-processing steps for addresses using a combination of edit distance and phonetic algorithms. Then we approach the task of creating vector representations for addresses using Word2Vec with TF-IDF, Bi-LSTM and BERT based approaches. We compare these approaches with respect to sub-region classification task for North and South Indian cities. Through experiments, we demonstrate the effectiveness of generalized RoBERTa model, pre-trained over a large address corpus for language modelling task. Our proposed RoBERTa model achieves a classification accuracy of around 90% with minimal text preprocessing for sub-region classification task outperforming all other approaches. Once pre-trained, the RoBERTa model can be fine-tuned for various downstream tasks in supply chain like pincode suggestion and geo-coding. The model generalizes well for such tasks even with limited labelled data. To the best of our knowledge, this is the first of its kind research proposing a novel approach of understanding customer addresses in e-commerce domain by pre-training language models and fine-tuning them for different purposes." @default.
- W3039539959 created "2020-07-10" @default.
- W3039539959 creator A5009410751 @default.
- W3039539959 creator A5009686735 @default.
- W3039539959 creator A5051582939 @default.
- W3039539959 date "2020-07-06" @default.
- W3039539959 modified "2023-09-27" @default.
- W3039539959 title "Deep Contextual Embeddings for Address Classification in E-commerce." @default.
- W3039539959 cites W135751440 @default.
- W3039539959 cites W1522301498 @default.
- W3039539959 cites W1647671624 @default.
- W3039539959 cites W168564468 @default.
- W3039539959 cites W1816313093 @default.
- W3039539959 cites W1956559956 @default.
- W3039539959 cites W1986703303 @default.
- W3039539959 cites W1998473728 @default.
- W3039539959 cites W2064675550 @default.
- W3039539959 cites W2101234009 @default.
- W3039539959 cites W2236139323 @default.
- W3039539959 cites W2427527485 @default.
- W3039539959 cites W2606964149 @default.
- W3039539959 cites W2626778328 @default.
- W3039539959 cites W2763768291 @default.
- W3039539959 cites W2784121710 @default.
- W3039539959 cites W2885185669 @default.
- W3039539959 cites W2896457183 @default.
- W3039539959 cites W2914481865 @default.
- W3039539959 cites W2920004210 @default.
- W3039539959 cites W2950133940 @default.
- W3039539959 cites W2950541952 @default.
- W3039539959 cites W2950784811 @default.
- W3039539959 cites W2962739339 @default.
- W3039539959 cites W2963310665 @default.
- W3039539959 cites W2965373594 @default.
- W3039539959 cites W2970971581 @default.
- W3039539959 cites W2978017171 @default.
- W3039539959 cites W2980282514 @default.
- W3039539959 cites W2989321829 @default.
- W3039539959 cites W3021027690 @default.
- W3039539959 cites W2995035536 @default.
- W3039539959 hasPublicationYear "2020" @default.
- W3039539959 type Work @default.
- W3039539959 sameAs 3039539959 @default.
- W3039539959 citedByCount "0" @default.
- W3039539959 crossrefType "posted-content" @default.
- W3039539959 hasAuthorship W3039539959A5009410751 @default.
- W3039539959 hasAuthorship W3039539959A5009686735 @default.
- W3039539959 hasAuthorship W3039539959A5051582939 @default.
- W3039539959 hasConcept C105795698 @default.
- W3039539959 hasConcept C119857082 @default.
- W3039539959 hasConcept C134306372 @default.
- W3039539959 hasConcept C137293760 @default.
- W3039539959 hasConcept C154945302 @default.
- W3039539959 hasConcept C162324750 @default.
- W3039539959 hasConcept C179518139 @default.
- W3039539959 hasConcept C186644900 @default.
- W3039539959 hasConcept C187736073 @default.
- W3039539959 hasConcept C204321447 @default.
- W3039539959 hasConcept C2776461190 @default.
- W3039539959 hasConcept C2780451532 @default.
- W3039539959 hasConcept C33923547 @default.
- W3039539959 hasConcept C34736171 @default.
- W3039539959 hasConcept C36503486 @default.
- W3039539959 hasConcept C41008148 @default.
- W3039539959 hasConcept C41608201 @default.
- W3039539959 hasConceptScore W3039539959C105795698 @default.
- W3039539959 hasConceptScore W3039539959C119857082 @default.
- W3039539959 hasConceptScore W3039539959C134306372 @default.
- W3039539959 hasConceptScore W3039539959C137293760 @default.
- W3039539959 hasConceptScore W3039539959C154945302 @default.
- W3039539959 hasConceptScore W3039539959C162324750 @default.
- W3039539959 hasConceptScore W3039539959C179518139 @default.
- W3039539959 hasConceptScore W3039539959C186644900 @default.
- W3039539959 hasConceptScore W3039539959C187736073 @default.
- W3039539959 hasConceptScore W3039539959C204321447 @default.
- W3039539959 hasConceptScore W3039539959C2776461190 @default.
- W3039539959 hasConceptScore W3039539959C2780451532 @default.
- W3039539959 hasConceptScore W3039539959C33923547 @default.
- W3039539959 hasConceptScore W3039539959C34736171 @default.
- W3039539959 hasConceptScore W3039539959C36503486 @default.
- W3039539959 hasConceptScore W3039539959C41008148 @default.
- W3039539959 hasConceptScore W3039539959C41608201 @default.
- W3039539959 hasLocation W30395399591 @default.
- W3039539959 hasOpenAccess W3039539959 @default.
- W3039539959 hasPrimaryLocation W30395399591 @default.
- W3039539959 hasRelatedWork W1967503395 @default.
- W3039539959 hasRelatedWork W2099031744 @default.
- W3039539959 hasRelatedWork W2107251604 @default.
- W3039539959 hasRelatedWork W2494546154 @default.
- W3039539959 hasRelatedWork W2910612103 @default.
- W3039539959 hasRelatedWork W2981852735 @default.
- W3039539959 hasRelatedWork W2981916418 @default.
- W3039539959 hasRelatedWork W2997129576 @default.
- W3039539959 hasRelatedWork W3003127496 @default.
- W3039539959 hasRelatedWork W3016913119 @default.
- W3039539959 hasRelatedWork W3021578384 @default.
- W3039539959 hasRelatedWork W3045573124 @default.
- W3039539959 hasRelatedWork W3093255796 @default.
- W3039539959 hasRelatedWork W3131603048 @default.
- W3039539959 hasRelatedWork W3134311017 @default.