Find Jobs
Hire Freelancers

Build a Website -- 2

$250-750 USD

Cancelado
Publicado hace más de 7 años

$250-750 USD

Pagado a la entrega
Hi there. I need someone to write a program and auto catch data from this site : [login to view URL] Not all the data. Look at the category list in the right site first. I need atuo catch some categories' data. Some of them are under [恋愛・少女漫画], from [ちはやふる] to [すきっていいなよ。]. Rest fo them are under [ファンタジー漫画], from [終りのセラフ] to [クレイモア]. You have to catch data in the way I need. Data Structure id----order url----URL of the article cat----category title----title magazine----magazine author----author genre----genre character----character site----goal website article----the text entry_data_at----publish time created_at----catch time picture----the cover the article *Check tip1,tip2,tip3 [cat],means the name of [login to view URL] the category list in the right [login to view URL] can see words like [ちはやふる] and [黒崎くんの言いなりになんてならない],they are categorise. [title],check the category [ちはやふる],turns to a new page,you can see words such as [ちはやふる33巻173首のネタバレ感想] or [ちはやふる33巻172首のネタバレ感想], they are titles. [article],check one title like [ちはやふる33巻173首のネタバレ感想],turns to a new page,you can see an article with lot of [login to view URL] have to catch the body which from the title(ちはやふる33巻173首のネタバレ感想) to the end of the article (end at the place above [目次][コメント] and advertisements). [entry_data_at],means the publish time of the articel,for example,the publish time of ちはやふる33巻173首のネタバレ感想 is the one written under the title - 2016/10/[login to view URL] have to record it by using timestamp,which would turn 2016/10/01 into 1451577600. [url],means the url of the article,like [login to view URL] [site],all write as [login to view URL] [character],for example,[login to view URL],under advertisements,there is a [目次] [login to view URL] can see [33巻173首] write in black and has no [login to view URL]'s the [character] About [author],[magazine],[genre],[picutre],[id] and [created_at],should do the following step first. Search [cat] in [login to view URL],use the first result. For example,search [ちはやふる] in [login to view URL],you can get: 作家:末次由紀 雑誌・レーベル: BE・LOVE ジャンル: スポーツ / 少女マンガ / アニメ化 / 映画化 So, [author],means the words after [作家:]. In the example the [author] is [末次由紀]. [magazine],means the words after [雑誌・レーベル:], In the example the [magazine] is [BE・LOVE]. [genre],means the words after [genre:],need to use "," to separate them. In the example the [genre] is [スポーツ,少女マンガ,アニメ化,映画化]. [pitucre],the cover of the first [login to view URL] have to catch covers and store [login to view URL] the datebase there should add a data bar of [pictuer] and have url of each cover. [id],means the order, the first one is 1, the second one is 2, etc. [created_at],means the time you catch the article,also have to record by using timestamp. For example,if I catch the date on UTC/GMT+08:00 2016/10/11 14:40:30, so the [created_at] should be 1476168030. Use [ちはやふる] as the example, do what I said,you can get: id:1 url:[login to view URL] cat:ちはやふる title:ちはやふる33巻173首のネタバレ感想 magazine:BE・LOVE author:末次由紀 genre:スポーツ,少女マンガ,アニメ化,映画化 character:33巻173首 site:[login to view URL] article:<h1 class="entry-title">.......... picture: ... entry_data_at:1451577600 created_at:1476168030 *Check the datebase sample photo This is what I [login to view URL] have to do in this way to make my server can recognize the data. Need to catch data 2 hours one [login to view URL] to send me the program you write to catch data. Please tab 1234 in your bid.
ID del proyecto: 11775055

Información sobre el proyecto

Proyecto remoto
Activo hace 7 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos

Sobre este cliente

Bandera de CHINA
kojima, China
5,0
25
Forma de pago verificada
Miembro desde may 5, 2016

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.