Professional Documents
Culture Documents
1
Agenda
Who are Magus?
What do we do?
Who do we do it for?
How do we do it?
SOA
Scalability
Storage
F/OSS
2
The Magus proposition
• Leading provider of innovative web-content
engineering
g g solutions to g
global corporations
p
• Specialise in managed applications that help
clients build value from their online assets and from
the wider web
• Three main applications:
ActiveStandards
RemoteSearch
CrucialInformation
• Delivering solutions since 1995
Our managed applications
z RemoteSearch a multi-site
multi site search engine,
engine providing integrated search
frameworks for enterprise websites
z CrucialInformation
C i lI f ti a premiumi currentt awareness service
i d delivering
li i
high-quality, strategic intelligence from the web and syndicated services
4
ActiveStandards
5
RemoteSearch
6
CrucialInformation
7
Social Networking
8
Our clients
9
Technically - where we were
1 product
• Web
W bd design
i bbusiness
i
• All home grown
• No appservers
• No failover
• No common
infrastructure
• Scalability worries
• No version
ersion control
• Unclear methodology
10
Technically – where we are now
• 3 main applications
pp
• Bespoke capability
• Common
infrastructure
• Platform of services
• Fault tolerant
• Scalable
• Defined process &
methodology
11
Approach
12
The components of the stack
• Trawl • Routing
• Harvest • Store
• Index • Quartz
• Search • ClientEngine
• Analysis • Profile
• Monitor • LinkChecker
13
Logical architecture
14
Trawl
15
Trawler service
Pluggable architecture based on JMX Mbean service
16
Harvest
17
Harvest service
Pluggable architecture based on JMX Mbean service
18
Index
19
Search
20
Analysis
21
Monitor
22
Classifier (monitor) service
Pluggable architecture based on JMX Mbean service
23
LinkChecker
• Responsible for checking the life of links and removing them correctly
from the system when they have expired
expired.
24
Routing
25
Content stores
26
Content stores - configuration
27
28
Store Interfaces
29
Store JMX Beans
30
Contentstore - engines
Can use many types of engine on a node
Currently supports:
Mysql
SleepyCat
Filesystem
32
Quartz
33
Client Engine
34
Management Application
zManage
g taxonomyy
zManage rules
zManage scheduling
zLeave
L service
i managementt tto JMX or web
b
consoles
zSwing
35
Management App
Management App
Management App
Management App
Profile
40
41
Infrastructure
architecture
hit t
42
Methodology
• Agility
g y – sprints
p
• Regular,
R l scheduled,
h d l d d
deployments
l t
43
Deployment
Deployment
1. C heck out
Subversion
(Code repository )
2. C ode /
Loc al T est
D eveloper
3. C hec k In
6 & 12 N otify
5 . Build / U nit T es ts /
Metr ic s
7 . Publish r esults
8. D eploy Bamboo
D ev C luster
19 . D eploy Applications
44
Throughput
• 11,000
, sources in system
y
• ~16
16,000,000
000 000 pages rolling store
• ~200,000
200 000 new pages per day
d
45
Cost comparisons
• Apples
pp and oranges?
g
Proprietary Licence Free
Product Per CPU CPUs Total Product Per CPU CPUs Total
O l
Oracle 20 000 00
20,000.00 10 $200 000
$200,000 M S l
MySql $0 00
$0.00 10 $0.00
$0 00
Weblogic AS 10,000.00 38 $380,000 Jboss AS $0.00 38 $0.00
MS Windows Server 3,919.00 48 $188,112 Redhat/Apa $0.00 48 $0.00
Visual Team Studio 1,000.00 12 $12,000 Eclipse $0.00 12 $0.00
ClearCase 4,125.00 1 $4,125 Subversion $0.00 1 $0.00
Jira 2 000 00
2,000.00 1 $2 000
$2,000 Trac $0 00
$0.00 1 $0 00
$0.00
Autonomy IDOL bundl 75,000.00 2 $150,000 Carrot2 $0.00 12 $0.00
IBM Intelligent Datami 132,000.00 1 $132,000 LingPipe $0.00 12 $0.00
Verity K2 50,000.00 2 $100,000 Lucene $0.00 8 $0.00
UIMA $0.00 12 $0.00
$1,068,237 $0.00
£580,531.26
€849,629.77
46
Thank you
Questions?
47