Amundsen

Data discovery and metadata engine for improving the productivity when interacting with data

Amundsen์€ ๋ฐ์ดํ„ฐ ์• ๋„๋ฆฌ์ŠคํŠธ, ๋ฐ์ดํ„ฐ ๊ณผํ•™์ž ๋ฐ ์—”์ง€๋‹ˆ์–ด๊ฐ€ ๋ฐ์ดํ„ฐ์™€ ์ƒํ˜ธ ์ž‘์šฉํ•  ๋•Œ ์ƒ์‚ฐ์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•œ ๋ฐ์ดํ„ฐ ๊ฒ€์ƒ‰ ๋ฐ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ์—”์ง„์ž…๋‹ˆ๋‹ค. ํ˜„์žฌ ๋ฐ์ดํ„ฐ ๋ฆฌ์†Œ์Šค (ํ…Œ์ด๋ธ”, ๋Œ€์‹œ ๋ณด๋“œ, ์ŠคํŠธ๋ฆผ ๋“ฑ)๋ฅผ ์ธ๋ฑ์‹ฑํ•˜๊ณ  ์‚ฌ์šฉ ํŒจํ„ด์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํŽ˜์ด์ง€ ์ˆœ์œ„ ์Šคํƒ€์ผ ๊ฒ€์ƒ‰์„ ์ œ๊ณตํ•˜์—ฌ ์ด๋ฅผ ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค (์˜ˆ : ์ฟผ๋ฆฌ ์ˆ˜๊ฐ€ ๋งŽ์€ ํ…Œ์ด๋ธ”์ด ์ ์€ ํ…Œ์ด๋ธ”๋ณด๋‹ค ๋จผ์ € ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค). ๋ฐ์ดํ„ฐ์˜ ๊ตฌ๊ธ€ ๊ฒ€์ƒ‰ ์—ญํ• ์„ ํ•˜๋Š” ๊ฒƒ์œผ๋กœ ์ƒ๊ฐํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ํ”„๋กœ์ ํŠธ๋Š” ๋‚จ๊ทน์—์„œ ์ฒซ ๋ฒˆ์งธ๋กœ ๋ฐœ๊ฒฌํ•œ ๋…ธ๋ฅด์›จ์ด ํƒํ—˜๊ฐ€ Roald Amundsen์˜ ์ด๋ฆ„์„ ๋”ฐ์™”์Šต๋‹ˆ๋‹ค.

Installationยถ

Bootstrap a default version of Amundsen using Dockerยถ

The following instructions are for setting up a version of Amundsen using Docker.

  1. Make sure you have at least 3GB of disk space available to Docker. Install docker and docker-compose.

  2. Clone this repo and its submodules by running:

    $ git clone --recursive https://github.com/amundsen-io/amundsen.git
  3. Enter the cloned directory and run the command below:

    # For Neo4j Backend
    $ docker-compose -f docker-amundsen.yml up
    
    # For Atlas
    $ docker-compose -f docker-amundsen-atlas.yml up

    If itโ€™s your first time, you may want to proactively go through troubleshooting steps, especially the first one related to heap memory for ElasticSearch and Docker engine memory allocation (leading to Docker error 137).

  4. Ingest provided sample data into Neo4j by doing the following: (Please skip if you are using Atlas backend)

  5. In a separate terminal window, change directory to databuilder.

  6. sample_data_loader python script included in examples/ directory uses elasticsearch client, pyhocon and other libraries. Install the dependencies in a virtual env and run the script by following the commands below. See Windows Troubleshooting if you encounter an error on python3 setup.py install regarding extas_require on windows.

     $ python3 -m venv venv
     $ source venv/bin/activate
     $ pip3 install --upgrade pip
     $ pip3 install -r requirements.txt
     $ python3 setup.py install
     $ python3 example/scripts/sample_data_loader.py
  7. View UI at http://localhost:5000 and try to search test, it should return some result.

  8. We could also perform an exact-match search for the table entity. For example: search test_table1 in table field and itโ€™ll return the records that matched.

Atlas Note: Atlas takes some time to boot properly. So you may not be able to see the results immediately after you run the docker-compose up command. Atlas would be ready once youโ€™ll have the following output in the docker output Amundsen Entity Definitions Created...

์ฐธ๊ณ ์ž๋ฃŒ

์„ค์น˜ ์˜ˆ์ œ ๋ธ”๋กœ๊ทธ ๐Ÿ‘

Last updated