Proceedings of the IV International research conference "Information technologies in Science, Management, Social sphere and Medicine" (ITSMSSM 2017)

Parsing of Data on Real Estate Objects from Network Resource

Authors
Vyacheslav Cherkesov, Vitaliy Malikov, Alexey Golubev, Danila Parygin, Tatiana Smykovskaya
Corresponding Author
Vyacheslav Cherkesov
Available Online December 2017.
DOI
10.2991/itsmssm-17.2017.80How to use a DOI?
Keywords
real estate object, network resource, parsing, data collection, BeautifulSoup, Scrapy
Abstract

Existing approaches for collecting data from sites on the Internet were considered. A comparative analysis of the solution based on the BeautifulSoup library and the Scrapy framework for parsing the content of network resources was made. Sources of information about real estate objects were analyzed. The method for parsing data on real estate objects was developed based on the results of the conducted studies. In addition, the main problems with the use of parsing technology were identified.

Copyright
© 2017, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the IV International research conference "Information technologies in Science, Management, Social sphere and Medicine" (ITSMSSM 2017)
Series
Advances in Computer Science Research
Publication Date
December 2017
ISBN
978-94-6252-432-3
ISSN
2352-538X
DOI
10.2991/itsmssm-17.2017.80How to use a DOI?
Copyright
© 2017, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Vyacheslav Cherkesov
AU  - Vitaliy Malikov
AU  - Alexey Golubev
AU  - Danila Parygin
AU  - Tatiana Smykovskaya
PY  - 2017/12
DA  - 2017/12
TI  - Parsing of Data on Real Estate Objects from Network Resource
BT  - Proceedings of the IV International research conference "Information technologies in Science, Management, Social sphere and Medicine" (ITSMSSM 2017)
PB  - Atlantis Press
SP  - 385
EP  - 388
SN  - 2352-538X
UR  - https://doi.org/10.2991/itsmssm-17.2017.80
DO  - 10.2991/itsmssm-17.2017.80
ID  - Cherkesov2017/12
ER  -