MCPcopy
hub / github.com/chrislusf/glow

github.com/chrislusf/glow @main sqlite

repository ↗ · DeepWiki ↗
623 symbols 1,627 edges 102 files 73 documented · 12%
README

glow

Build Status GoDoc

Purpose

Glow is providing a library to easily compute in parallel threads or distributed to clusters of machines. This is written in pure Go.

I am also working on another pure-Go system, https://github.com/chrislusf/gleam , which is more flexible and more performant.

Installation

$ go get github.com/chrislusf/glow
$ go get github.com/chrislusf/glow/flow

One minute tutorial

Simple Start

Here is a simple full example:

package main

import (
    "flag"
    "strings"

    "github.com/chrislusf/glow/flow"
)

func main() {
    flag.Parse()

    flow.New().TextFile(
        "/etc/passwd", 3,
    ).Filter(func(line string) bool {
        return !strings.HasPrefix(line, "#")
    }).Map(func(line string, ch chan string) {
        for _, token := range strings.Split(line, ":") {
            ch <- token
        }
    }).Map(func(key string) int {
        return 1
    }).Reduce(func(x int, y int) int {
        return x + y
    }).Map(func(x int) {
        println("count:", x)
    }).Run()
}

Try it.

  $ ./word_count

It will run the input text file, '/etc/passwd', in 3 go routines, filter/map/map, and then reduced to one number in one goroutine (not exactly one goroutine, but let's skip the details for now.) and print it out.

This is useful already, saving lots of idiomatic but repetitive code on channels, sync wait, etc, to fully utilize more CPU cores.

However, there is one more thing! It can run across a Glow cluster, which can be run multiple servers/racks/data centers!

Scale it out

To setup the Glow cluster, we do not need experts on Zookeeper/HDFS/Mesos/YARN etc. Just build or download one binary file.

Setup the cluster

  # Fetch and install via go, or just download it from somewhere.
  $ go get github.com/chrislusf/glow
  # Run a script from the root directory of the repo to start a test cluster.
  $ etc/start_local_glow_cluster.sh

Glow Master and Glow Agent run very efficiently. They take about 6.5MB and 5.5MB memory respectively in my environments. I would recommend set up agents on any server you can find. You can tap into the computing power whenever you need to.

Start the driver program

To leap from one computer to clusters of computers, add this line to the import list:

    _ "github.com/chrislusf/glow/driver"

And put this line as the first statement in the main() function:

    flag.Parse()

This will "steroidize" the code to run in cluster mode!

$ ./word_count -glow -glow.leader="localhost:8930"

The word_count program will become a driver program, dividing the execution into a directed acyclic graph(DAG), and send tasks to agents.

Visualize the flow

To understand how each executor works, you can visualize the flow by generating a dot file of the flow, and render it to png file via "dot" command provided from graphviz.

$ ./word_count -glow -glow.flow.plot > x.dot
$ dot -Tpng -otestSelfJoin.png x.dot

Glow Hello World Execution Plan

Read More

  1. Wiki page: https://github.com/chrislusf/glow/wiki
  2. Mailing list: https://groups.google.com/forum/#!forum/glow-user-discussion
  3. Examples: https://github.com/chrislusf/glow/tree/master/examples/

Docker container

Docker is not required. But if you like docker, here are instructions.

# Cross compile artefact for docker
$ GOOS=linux GOARCH=amd64 CGO_ENABLED=0 go build .
# build container
$ docker build -t glow .

See examples/ directory for docker-compose setups.

Contribution

Start using it! And report or fix any issue you have seen, add any feature you want.

Fork it, code it, and send pull requests. Better first discuss about the feature you want on the mailing list. https://groups.google.com/forum/#!forum/glow-user-discussion

License

http://www.apache.org/licenses/LICENSE-2.0

Extension points exported contracts — how you extend this code

DataStore (Interface)
(no doc) [2 implementers]
netchan/store/data_store.go
ContextRunner (Interface)
(no doc) [1 implementers]
flow/context_run.go
Object (Interface)
(no doc)
driver/scheduler/market/cda_market.go
TaskRunner (Interface)
(no doc) [1 implementers]
flow/context_run.go
Requirement (Interface)
(no doc)
driver/scheduler/market/cda_market.go

Core symbols most depended-on inside this repo

Map
called by 46
flow/dataset_map.go
w
called by 36
driver/plan/group_plot.go
Flag
called by 31
util/read_write.go
Close
called by 30
netchan/store/rotating_file_store.go
String
called by 28
resource/resource.go
println
called by 21
driver/plan/group_plot.go
Name
called by 19
flow/step_task.go
URL
called by 17
resource/resource.go

Shape

Method 334
Function 188
Struct 89
TypeAlias 7
Interface 5

Languages

Go100%

Modules by API surface

driver/cmd/control_message.pb.go111 symbols
netchan/store/rotating_file_store.go27 symbols
resource/topology.go18 symbols
resource/resource.go18 symbols
flow/context.go14 symbols
driver/scheduler/market/cda_market.go14 symbols
driver/plan/group.go14 symbols
flow/context_run.go13 symbols
resource/service_discovery/master/master_allocation.go12 symbols
driver/plan/group_plot.go12 symbols
driver/context_driver.go12 symbols
util/read_write.go11 symbols

Datastores touched

(mongodb)Database · 1 repos

For agents

$ claude mcp add glow \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact